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fK| (54) Title: METHOD FOR DETECTING PRE-DISPOSmON TO HEPAT0TOXI(3TY 
IT) 

vo 

^ (57) Abstract: This invention relates to a method for diagnosing a pre-disposition to drug induced livertoxicity which method com- 
^ prises determining at least one single nucleotide polymorphism in the UDP-glucuronosyl transferase (UGTl) gene. Said method is 
2 based on determining specific single nucleotide polymorphisms in the UGTl gene in a human being and determining the status of 
said human being by reference to polymoiphism in UGTl . The invention further relates to diagnostic nucleic acids comprising within 
^ their sequence the polymorphisms as defined herein, to allele-specific primers and allele-specific oligonucleotide probes capable of 
^ hybridizing to such diagnostic nucleic acids and to diagnostic kits comprising one or more of such primers and probes for detecting 
^ a polymorphism in the UGTl gene. 



PCT/EPOl/07524 

-1- 
Case 20707 

Method for detecting pre-disposition to hepatotoxicity 

The present invention relates to i method for diagnosing a pre-disposition to drug 
induced livertoxicity which method comprises determining the polymorphisms in the 
UDP-glucuronosyl transferase (UGT 1 ) gene. 

UGTl is a member of the UDP glucuronosyltransferase (UGT) gene superfamily. 

5 UGT enzymes catalyze the addition of the glucuronosyl group from a nucleotide sugar to a 
small hydrophobic molecule (aglycone) in order to enhance the water solubility of endo- 
and xenobiotics. UGT enzymes are involved in the metabolism of a large number of drugs. 
For a review on this enzyme superfamily see Pharmacogenetics (1997) 7, 255-269. The 
presence of at least nine UDP-^ucuronosyl transferase isoenzymes has been described in 

10 International patent application WO 92/12987. 

One of these enzymes, the human bilirubin glucuronosyl transferase gene encoded at 
the UGTl locus has been associated with gene defects by Ritter et al., J. Clinl Invest (1992), 
20, 150-155; Aomoo et al., Biochem. Biophys. Res. Commun. (1993), 12Z, 1239-1244; 
Moghrabi et al.. Am. J. Hum. Genet (1993), 53, 722-729; Labrune et al., Hum. Genet 
15 (1994), 94> 693-697 and Seppeu et al, J. Clin. Invest (1994), 26S, 2385-2391. 

The term polymorphism relates to the observation that different nucleotides can 
occur at a given position in a specific DNA sequence. Genetic polymorphisms occur at 
random throughout the genome. Genetic polymorphisms may affect the function of a gene 
by altering the structure of the protein that the gene codes or by affecting the level of 
20 expression of that gene. 

Genetic variations or polymorphisms among individuals are responsible to a great 
extent for the observable biological differences between individuals. Genetic variations are 
also responsible for the differences on how individuals respond to a drug. 

In vitro experiments have shown that variations in UGT1A6 and UGT1A7 genes 
25 affect the enzymatic activity on specific substrates of the enzymes coded by these genes. 

• Genetic polymorphisms in the human UGTl A6 (plasma phenol) UDP-glucuronosyl 
transferase and the pharmacological implications thereof have been described by Ciotti et 
al.. Pharmacogenetics ( 1997), 2, 485-495. The cloned and isolated UGTIAS*! allelic 
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variant (which contains the ThrlSlAla and Arg 184Ser mutations) when ejqjressed in COS 
cells metabolised, at pH 6.4, the substrates 4-nitrophenol, 4-tert-butylphenol, 3-ethyi- 
phenol, 4-eth)dphenol, 4-hydroxycoumarin, butylated hydroxy anisole and butylated 
hydroxy toluene at only 27-75% of the rate of the wild-type isoenzyme. l-Naphtol> 3- 
5 iodophenol, 7~hydroxycoumarin, and 7-hydroxy-4-methylcoumarin were metabolised at 
normal levels. 3-0-Methyl-dopa and ntethyl salicylate were metabolised at 41-74% and P- 
blockers at 28-69% of the rate of the wild-type isoenzyme. 

Guillemete et al. Pharmacogenetics (2000), 10, 629-644, showed that three different 
cloned and isolated allelic variants of UGTlA7y when expressed in HEK cells, showed 

10 different catalytic activity towards the substrates 3-> 7- and 9-hydroxy-ben2o-(a>-pyrene as 
compared to the wild-type enzyme. UGT1A7*3 (Lys 129 Lysl31 Arg208) exhibited 5.8-fold 
lower Vmax relative to the wild-type UGT1A7*! (Asnl29 Argl3 1 Trp208), whereas 
UGT1A7'^2 (Lysl29 Lysl31 Trp208) and UGrJA7*4 (Asnl29 Argl31 Arg208) had a 2.6 
and 2.8-fold lower relative Vnax than UGTIAT*!. While the results mentioned above 

15 indicate that genetic variations in the UGTl genes, when doned and expressed in cultured 
cells, can afifect the enzymatic activity of the corresponding gene products on a particular 
substrate, they show no proof that these variations have a significant effect on the 
metabolism of these substrates in human individuals. 

It has now been foimd that specific genetic variations within the UGTl gene 
20 complex, including UGT1A6 and UGT1A7, affect the response of individual human 
patients to a drug which is metabolised by these enzymes. 

Pharmacogenetics is an approach to use the knowledge of polymorphisms to study 
the role of genetic variation among individuals in variation to drug response, a variation 
that often results from individual diflferences in drug metaboUsm. Pharmacogenetics helps 

25 to identify patients most suited to therapy with particular pharmaceutical agents. This 
approach can be used in pharmaceutical research to assist the drug selection process. 
Polymorphisms are used in mapping the human genome and to elucidate the genetic 
component of diseases. Details on pharmacogenetics and other uses of polymorphism 
detection can be found in Linder et aL (1997), Clinical Chemistry, 43> 254; Marshall 

30 (1997), Nature Biotechnology, 15, 1249; International Patent Application WO 97/40462, 
Spectra Biomedical; and Schafer et al. (1998), Natmre Biotechnology, 16, 33. 

As particular mutations or polymorphisms associated with certain clinical features, 
such as adverse or abnormal events, are likely to be of low frequency within the population, 
low frequency SNPs may be particularly useful in identifying these mutations (for 
35 examples see: Linkage disequiUbrium at the cystathionine beta synthase (CBS) locus and 
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the association between genetic variation at the CBS locus and plasma levels of 
homocysteine (De Stefano et al., Ann. Hmn. Genet. (1998) 62, 481-90). Variation at the 
von Willebrand fector (vWF) gene locus is associated with plasma vWF: Ag levels: 
identification of three novel single nucleotide polymorphisms in the vWF gene promoter 
5 (Keightley et al.> Blood (1999) 93, 4277-83). 

It has been found that in rare cases the administration of pharmaceutically active 
agents to human beings leads to hepatotoxidty. A typical example is the occurrence of 
reversible asymptomatic increase in liver transaminases activity found in certain patients 
with Parkinson^s disease (PD) who had participated in dinical trials for tolcapone. The 

10 studies indicated that in rare circumstances, tolcapone could induce a reversible 
asymptomatic increase in liver transaminase activity. There was therefore a desire to 
establish whether there is a correlation between the occurrence of such liver abnormalities, 
which are indicators of liver toxicity and certain genetic pre- dispositions. It has now been 
found that variations in genes involved in tolcapone (TASMAR) metabolism and 

15 pharmacology cause abnormalities in metabolic activity in certain individuals, resulting in 
an accumulation in the liver of this drug or its metabolites to toxic levels. 

Clinical trials have shown that patient response to treatment with pharmaceuticals is 
often heterogeneous. Thus there is also a need for improved approaches to pharmaceutical 
agent design and therapy. 

20 The present invention therefore provides a genetic diagnostic tool for identifying the 

pre-disposing genotypes. Said tool consists of a method for detecting a predisposition to a 
hepatotoxic reaction caused by the administration of a pharmaceutically active compound 
to a human being based on the determination of at least one single nucleotide 
polymorphism in the UDP-^ucuronosyltransferase (UGTl) gene in the sample of said 

25 human being, which method comprises determining the nucleotide at position 90S in exon 
5 of the UGTl gene as defined by the position in SEQ ID NO:l and determining the status 
of the human being by reference to polymorphism in UGTl. Alternatively or, in addition 
thereto, the method comprises determining the sequence of the nucleic acid of the human 
being at position 528 in exon 1 of the UGT1A6 gene as defined by SEQ ID NO:2 or 

30 determing the sequence of the nucleic add of the human being at position 197 in exon I of 
the UGTl A7 gene as defined by sequence ID NO:3 and determining the status of said 
human being by reference to polymorphism in UGTl. 

An individual possesses a predisposition to a hepatotoxic reaction when his UGTl 
gene contains variations which lead to abnormalities in its metabolic activity. 
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SEQ ID N0:1 refers to Genbank accession number M84124, which provides exon 5 
ofUGTl: 

1 TAATTCCAGC TACTCTGGA6 GCTGAGGCAG GAGGATGGCT TGAGCCCAGG 

5 

51 AGTTGGAGGC TGCAGTTAGC CATGCTTGTG CCACTACACT CCAGCCCGGG 

I 

101 GAACAGGGCA AGACTCTGTA TCTAAAAACA ACAACAACAA CAATAATAGA 

10 . 151 AACAGGTTTC CTTTCCCAAG TTTGGAAAAT CTGGTAGTCT TCTTAAGCAG 

201 CCATGAGCAT AAAGAGAGGA TTGTTCATAC CACAGGTGTT CCAGGCATAA 

251 CGAAACTGTC TTTGTGTTTA GTTACAAGGA GAACATCATG CGCCTCTCCA 

15 

301 GCCTTCACAA GGACCGCCCG GTGGAGCCGC TGGACCTGGC CGTGTTCTGG 

351 6TGGAGTTT6 TGATGAGGCA CAAGGGCGC6 CCACACCTGC GCCCCGCAGC 

20 401 CCACGACCTC ACCTGGTACC AGTACCATTC CTTGGACGTG ATTGGTTTCC 

451 TCTTGGCCGT CGTGCTGACA GTGGCCTTCA TCACCTTTAA ATGTTGTGCT 

501 TATGGCTACC GGAAATGCTT GGGGAAAAAA GGGCGAGTTA AGAAAGCCCA 

25 

551 CAAATCCAAG ACCCATTGAG AAGTGGGTGG GA7ATAAGGT AAAATTTTGA 

601 .ACCATTCCCT AGTCATTTCC AAACTTGAAA ACAGAATCAG TGTTAAATTC 

30 651 ATTTTATTCT TATTAAGGAA ATACTTTGCA TAAATTAATC AGCCCCAGAG 

701 TGCTTTAAAA AATTCTCTTA AATAAAAATA ATAGACTCGC TAGTCA6TAA 

751 A6ATATTTGA ATATGTATCG TGCCCCCTCT GGTGTCTTTG ATCAGGATGA 

35 

801 CATOTGCCAT TTTTCAGAGG ACGTGCAGAC AGGCTGGCAT TCTAGATTAC 

851 TTTTCTTACT CTGAAACAT6 GCCTGTTTGG GAGTGCGGGA TTCAAAGGTG 

40 901 GTCCCACGGC TGCCCCTACT GCAAATGGCA GTTTTAATCT TATCTTTTGG 

951 CTTCTGCAGA TGGTTGCAAT TGATCCTTAA CCAATAATGG TCAGTCCTCA 

1001 TCTCTGTCGT GCTTCATAGG TGCCACCTTG TGTGTTTAAA GAAGGGAAGC 

45 

1051 TTTGTACCTT TAGAGTGTA6 6TGAAAT6AA TGAATGGCTT G6AGTGCACT 

1101 GAGAACAGCA TAT6ATTTCT TGCTTTGGG6 AAAAAGAAT6 ATGCTATGAA 

50 1151 ATTGGTGGGT GGTGTATTT6 AGAAGATAAT CATTGCTTAT GTCAAATGGA 

1201 GCTGAATTTG ATAAAAACCC AAAATACAGC TATGAAGTGC TGGGCAAGTT 

1251 TACTTTTTTO CTGATGTTTC CTACAACTAA AAATAAATTA ATAAATTTAT 

1301 ATAAATTCTA TTTAAGTGTT TTCACTGGTG TCGCATTTAT TTCTTGTTAA 



55 
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1351 GTTGCATTTT CTAATTACAA AAGTAATGCA TGATTATGAC AGAAACTTTG 
1401 GAAAATATAG AGGTTCACAC ACACACGCCT TCATTGCGTG TGCATGCATA 
1451 AATGCATGAG AAAAGAAAAA TAACCAGTAA TCACATCGCC CAGAAATAAC 
1501 CCCAGTTACA ATTGTGGCAA ATACACATAC TTATAAATAT TGCAGATATA 
1551 TTAAGTATAC C I 

The present invention is based on the discovery of novel sin^e nucleotide 
polymorphisms (SNP) in the UGTl gene locus» viz. 

at position 908 of SEQ ID NO:l. The polymorphism at this position consists of the 
replacement of the nucleotide G at this position by a C in exon 5 of the UGTI gene locus; 

15 at position 528 of SEQ ID NO:2. The polymoiphism at thi&position consists of the 

replacement of the nucleotide A at this position by a G in exon 1 of the UGTl A6 gene; 

and at position 197 of SEQ ID NO:3. The polymorphism at this position consists of 
the replacement of the nucleotide C at this position by a G in exon 1 of the UGTl A7 gene. 

As defined herein, the UGTl gene includes exon coding sequences for all different 
20 UGTIA isozymes, intron sequences intervening the exon sequences and 3' and 5' 

untranslated region (3' UTR and 5* UTR) sequences, including the promoter element of 
the UGTl gene, encoding for all UGTIA isozymes. 

SEQ ID NO:2 refers to Genbank accession number M84130, which provides exon 1 
ofUGTlA6: 

25 

1 TGACACGGCC ATAGTTGGTT CATATTAACC ATGTGATTAA AAT6GTTAAA 

51 TATTAATTTG GGTTCTTACA TATCAAAGGG TAAAATTCAG AGCAAGGGA6 

30 101 AGGTAGACAG GACCTGTGAA AAGCAGTGGT TAGTTTAGGG AAAATACCTA 

151 GGAGCCCTGT 6ATTTGGAGA GTGAAAACTC TTTATTACCG TTGTTACTTT 

201 AACTCTTTCC AGGATGGCGT GCCTCCTTCG CTCATTTCAG AGAATTTCTG 

35 

251 CAGGGGTTTT CTTCTTAGCA CTTTGGGGCA TGGTTGTAGG TGACAAGCTG 

301 CTGGTGGTCC CTCAGGACGG AAGCCACTGG CTTAGTATGA AGGATATAGT 

40 3 51 TGAGGTTCTC AGTGACCGGG GTCATGAGAT TGTAGTGGTG GTGCCTGAAG 

401 TTAATTTGCT TTTGAAAGAA TCCAAATACT ACACAAGAAA AATCTATCCA 

451 GTGCCGTATG ACCAAGAAGA GCTGAAG7VAC CGTTACCAAT CATTTGGAAA 
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501 CAATCACTTT GCTGAGCGAT CATTCCTAAC TGCTCCTCAG ACAGAGTACA 

551 GGAATAACAT GATT6TTATT GGCCTGTACT TCATCAACTG CCAGAGCCTC 

5 

601 CTGCAGGACA GGGACACCCT GAACTTCTTT AAGGAGAGCA AGTTTGATGC 

651 TCTOTrCACA 6ACCCAGCCT TACCCTGTGG GGT6ATCCTG GCTGAGTATT 

I 

10 701 TGGGCCTACC ATCTGTGTAC CTCTTCAG6G GTTTTCCGT6 TTCCCTGGAG 

751 CATACATTCA GCAGAAGCCC AGACCCTGTG TCCTACATTC CCAGGTGCTA 

801 CACAAAGTTT TCAGACCACA TGACTTTTTC CCAACGA6TG GCCAACTTCC 

15 

851 TTGTTAATTT GTTGGAGCCC TATCTATTTT ATTGTCTGTT TTCAAAGTAT 

901 GAAGAACTCG CATCAGCTGT CCTCAAGA6A GATGTGGATA TAATCACCTT 

20 951 ATATCAGAAG GTCTCTGTTT GGCTGTTAAG ATAT6ACTTT GTGCTT6AAT 

1001 ATCCTAGGCC GGTCATGCCC AACATGGTCT TCATTGGAGG TATCAACTCT 

1051 AAGAAGAGGA AAGACTT6TC TCAGGTTGGT GGGTTTATTT CTTTTGGACT 

25 

1101 GCCTTGTTTC TTCCAGGCTC TGTCCTCCCT CACTCATTTG GCTCCTTGAG 

1151 CCGACTGTCC CTTGGAGGAT TTCCTGGAGA ACGGTGGGGG GAAGTGATAC 

30 1201 CCGGCTCGGA GCAGCGGGAA 

SEQ ID NO:3 refers to Genbank accession number U39570, which provides exon 1 
ofUGTlA7: 

35 1 TGTATTATTA TGAGTAAATC ATTGGCAGTG AATGT6AATT TTTTTTTAAA 

51 TGAATGAATA AGTACACGCC TTCTTTTGAG GGCAGGTTCT ATCTGTACTT 

101 CTTCCACTTA CTATATTATA GGAGCTTAGA ATCCCAGCTG CTGGCTCTG6 

40 

151 GCTGAAGTTC TCT6ATGGCT CGTGCAGGGT GGACTGGCCT CCTTCCCCTA 

201 TATGTGTGTC TACTGCTGAC CTGTGCTTTG CCAAQGTCAG GGAAGCTGCT 

45 251 GGTAGTGCCC ATGGATGGGA GCCACTGGTT CACCATGCAG TCGGTGGTGG 

301 AGAAACTCAT CCTCAGGGGG CATGAGGTGG TCGTAGTCAT GCCAGAGGTG 

351 AGTTGGCAAC TGGGAAGATC ACTGAATTGC ACAGTGAAGA CTTACTCAAC 

50 

401 CTCATACACT CTGQAGGATC AG6ACCGGGA. GTTCATGGTT TTTGCCGATG 

451 CTCGCTGGAC GGCACCATT6 CGAAGTGCAT TTTCTCTATT AACAAGTTCA 

55 501 TCCAATGGTA TTTTTGACTT ATTTTTTTCA AATTGCAGGA GTTTGTTTAA 

551 TGACCGAAAA TTAGTAGAAT ACTTAAAG6A GAGTT6TTTT GATGCAGTGT 
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15 



20 



25 



30 



35 



40 



601 TTCTCGATCC TTTTGATCGC TGTGGCTTAA 

651 CTCCCCTCTG TGGTCTTCGC CAGGGGAATA 

701 AGGTGCACAG TGCCCTGCTC CTCTTTCCTA 

751 GGTTCTCA6A CGCCATGACT TTCAAGGA6A 

I 

801 CACTTGGAGG AACATTTATT TTGCCCCTAT 

851 AATAGCCTCT GAAATTCTCC AAACCCCTGT 

901 GCCACACATC AATTTGGTTG TTGCGAACT6 

951 AAACCCGTGA TGCCCAATAT GATCTTCATT 

1001 GGGAAAGCCA CTGCCTATGG TAAGTTATCT 

1051 TAATCTGGCT TTGGAAATTA AAAGATTTCT 

1101 TTTACATTTG TCCCATTTGG AATTTCTTTC 

1151 TACCAATTCA CTTAATTGTT GGGTAGCAAA 

1201 TGATATGTAA GTGTATACAA TTGATATAAT 

1251 CAATCTAAAT GCTATTTTTG GAAAAATAC 



TTGTT6CCAA 
TTTTGCCACT 
TGTCCCCAGA 
GAGTATGGAA 
TTTTTCAAAA 
CACGGCATAT 
ACTTTGTTTT 
GGTGGTATCA 
CCCCTTTAGC 
TACAGAATCA 
TGGTTTAAGG 
TTGTATAAAG 
TGTAGATCAT 



ATATTTCTCC 
ATCTTGAAGA 
CTTCTCTTA6 
CCACATCAT6 
ATGTCTTAGA 
GATCTCTACA 
GGAGTATCCC 
ACTGTCATCA 
ACATTAAGAA 
TAATTTATCA 
AATTCTTTTG 
CAGCTCTTGT 
ATCTAGGCTG 



Furthermore the invention relates to a method for detecting a predisposition to liver 
toxicity after administration of a pharmaceutically active compound based on one or more 
single nucleotide polymorphism(s) in the UDP-glucuronosjdtransferase (UGTl) gene 
locus in a human being, wherein additionally the polymorphism at one of the following 
positions is determinpd: 



232 in exon 1 of UGTl A6 as defined by the position 
754 in exon 1 of UGT1A6 as defined by the position 
765 in exon 1 of UGT1A6 as defined by the position 
551 in exon 1 of UGT1A7 as defined by the position 

555 in exon 1 of UGT1A7 as defined by the position 

556 in exon 1 of UGTl A7 as defined by the position 
786 in exon 1 of UGT1A7 as defined by the position 



ofSEQIDNO:2; 

inSEQIDNO:2, 

inSEQIDN0:2; 

mSEQIDN0:3; 

inSEQIDN0:3; 

inSEQID NO:3;or 

inSEQID N0:3. 



The polymorphism at position 232 in exonl of UGT1A6 of SEQ ID NO:2 consists of 
a replacement of the nucleotide T at this position by a G (which results in a Ser to Ala 
amino acid exchange at position 7 in the corresponding protein). 
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The polymorphism at position 754 of SEQ ID NO:2 consists of a replacement of the 
nucleotide A at this position by a G in exon 1 of UGT1A6 (which results in a Thr to Ala 
amino add exchange at position 181 in the corresponding protein). 

The polymorphism at position 765 of SEQ ID NO:2 consists of a replacement of the 
5 nucleotide A at this position by a C in ^on 1 of UGT1A6 (which corresponds to a Arg to 
Ser amino add exchange at position 184 of the corresponding protein). 

The polymorphism at position 551 of SEQ ID N0:3 consists of the replacement of 
the nudeotide T at this position by a G in exon 1 of the UGT1A7 gene and results in an 
amino acid exchange from Asn to Lys at position 129 in the corresponding protein. 

10 The polymorphism at position 555 of SEQ ID NO:3 consists of the replacement of 

the nucleotide C at this position by an A in exon 1 of the UGTl A7 gene. This may result in 
a silent mutation (if the nudeotide at position 556 of SEQ ID NO:3 consists of a G) or 
result in an amino add exchange from Arg to Lys at position 131 in the corresponding 
protein (if the nudeotide at position 556 of SEQ ID NO:3 consists of an A). 

15 The polymorphism at position 556 of SEQ ID N0:3 consists of the replacement of 

the nucleotide G at this position by an A in exon 1 of the UGT1A7 gene and results in an 
amino add exchange from Arg to Gin at position 131 in tfie corresponding protein in case 
the nudeotide at position 555 is a C or in an amino acid exchange from Arg to Lys at 
position 131 in case the nudeotide at position 555 is an A. 

20 The polymorphism at position 786 of SEQ ID NO:3 consists of a replacement of the 

nucleotide T at this position by a C in exon 1 of UGT1A7 (which results in a Trp to Arg 
exchange at position 208 in the corresponding protein). 

Thus, the invention rdates to a method of detecting a predisposition to liver toxidty 
after administration of a pharmaceutically active compovmd based on the determination of 

25 at least one single nudeotide polymorphism, in which the single nudeotide polymorphism 
at position 908 in exon 5 of the UGTl gene locus consists of the presence of a C or a G> the 
single nudeotide polymorphism at position 528 in exon 1 of UGT1A6 consists of the 
presence of a G or an A, the single nudeotide polymorphism at position 197 in exon 1 of 
UGTl A7 consists of the presence of a G or a C, the single nudeotide polymorphism at 

30 position 232 in exon 1 of UGT1A6 consists of tiie presence of a G or a T> the single 

nudeotide polymorphism at position 754 in exon 1 of UGT1A6 consists of the presence of 
an A or a G, the single nudeotide polymorphism at position 765 in exon 1 of UGTl A6 
consists of the presence of an A or a C, the single nudeotide polymorphism at position 551 
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in exon 1 of UGTl A7 consists of the presence of an G or a T, the single nucleotide 
polymorphism at position 555 in exon 1 of UGT1A7 consists of the presence of an A or a 
C, the single nucleotide polymorphism at position 556 in exon 1 of UGTl A7 consists of the 
presence of an A or a C> and the single nucleotide polymorphism at position 786 in exon 1 
5 of UGTl A7 consists of the presence of a C or a T. 

A number of pharmaceutically active compounds are kno^ which cause a 
hepatotoxic reaction. Examples of such compounds are nitrocatechol derivatives like 
entacapone> nitecapone or tolcapone. The main metabolic pathway for these drugs is 
glucuronidation. 

10 Glucuronidation is an important pathway of elimination of many xenobiotics 

including drugs. The UGTl enzymes are well-known to catalyze the glucuronidation of 
many ehdogeneous and exogeneous substrates, including many drugs. Pharmacokinetic 
experiments in human subjects have shown that Ihe main pathway of tolcapone 
elimmation from the body is glucuronidation (Jorga et al., Br. J. Clin. Pharmacol (1999), 

15 4, 513-20. Deficiencies in the elimination pathway of a drug can be the cause of adverse 
effects. The present invention shows a first example of genetic variations, some of which 
known from in vitro e3q)eriments to affect the glucuronidation activity of a number of 
substrates including drugs, which are significantly associated with the devdopment of 
adverse effects in human patients treated with a drug. From these results it can be 

20 concluded that the method described herein can be applied to predict the predisposition to 
adverse effects of any drug that is metabolised by UGTl enzymes. 

Drug glucuronidation by UGTs is a major phase II conjugation reaction in the 
mammahan detoxification system (Burchell et al., Life Sd. (1995), 5Z, 1819-31). 
Polymorphisms in UGTs can markedly affect binding of a substrate, which can be 

25 manifested either as a dinical syndrome (if an endogenous substrate is affected) or as a 
change of response to a drug and/or as a adverse event (if a drug is affected). Therefore it is 
important to identify genetic sequence polymorphisms in the UGTl gene in general. 
Nucleid adds comprising the polymorphic sequences can be used in screening assays, and 
for genotyping individuals. The genotyping information can be used to predict an 

30 individual's rate of metaboHsm for UGTl substrates, potential drug-drug interactions, and 
adverse/side effects as well as diseases that result from environmental or occupational 
exposure to toxins. The nuddc adds can be used to establish animal, cell and in vitro 
models for drug metabolism. All the following identified polymorphisms are amenable to 
be associated with an individual's rate of metabolism for UGTl substrates, potential drug- 



wo 02/06523 



PCT/EPOl/07524 



-10- 

drug interactions, adverse/side effects and diseases that result from environmental or 
occupational exposure to toxins. 



Table 1. SNP positions 



gene 


acc.no* 


SNP i 
1 


SNP pos. 


substit. 


AA SUbSt* 


U6T1A6 


M84130 


axon 1 


232 


T/G 


Ser7Ala 




SEQ ID N0:2 














exon 1 


318 


C/T 


silent 






axon 1 


528 


A/G 


silent 






exon 1 


754 


A/6 


ThrlSlAla 






exon 1 


765 


A/C 


Argl84Ser 






exon 1 


840 


G/T 


silent 


UGTl A7 


U39570 


5'UTR 


108 


T/G 






SBQ ID NO: 3 














exon 1 


197 


C/G 


silent 






exon 1 


551 


T/G 


Asnl29Lys 






exon 1 


555 


C/A 


silent or 












Argl31Lys 






exon 1 


556 


G/A 


Argl31Gln or Lys 






exon 1 


786 


T/C 


Trp208Arg 






exon 1 


920 


G/A 


silent 






exon 1 


824 


C/T 


silent 






exon 1 


992 


C/A 


Asn/Lys 


UGTl AlO 


U39550. 


exon 1 


959 


C/T 


0 ^ ^ ^ 




SEQ ID NO:4 










UGTl A8 


U42604 


prom 


245 


C/A 






SEQ ID NO: 5 










UGTl AS 


AF056188 


exonl 


214 


C/T 


silent 




SEQ ID NO: 6 










UGT ex2-5 


M84122 


intron 


117 


C/T 






SEQ ID NO: 7 










coininon to 




intron 


379 


C/T 




all UGTlAs 


M84123 


exon4 


473 


G/T 


Gly/Val 




SBQ ID NO: 8 












M84124 


3'UTR 


423 


T/G 






SEQ ID N0:1 














3'UTR 


780 


T/C 








3'UTR 


908 


G/C 








3'UTR 


ioi2 


C/G 





5 



The SNP positions in Table 1 always refer to the position in the sequence with the 
specified accession number in the pubUc domain and the corresponding SEQ ID NO. given 
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in this application. Primer sequences for genot/ping assays are given in the method section. 
For nucleotide substitution the nucleotide of the wildtype allele is given first> same for the 
amino acid substitutions. SEQ ID NOs 1-3 are given above, SEQ ID NOs 4-8 are following 
bdow. 

5 SEQ ID NO:4 refers to Genbank accesion number U39550, which provides exon 1 of 

UGTIAIO. 

1 CTCTCCCTCC AAGGCGAAGA CCATAATCTA CTCTTGTCTG AAATCATACA 

10 51 AGTAGGTATC TCAGCAAATG ATACTCGTGT GTTATCGTTC TTATGAGTAA 

101 ATCATTGGCA 6TGAGTGT6A TTTTTTTTTT TTTTATGAAA 6GATAAATAC 

151 ACGCCCTCTA TT6G6GTCAG GTTTTGTGCC TGTACTTCTT CCGCCTACTG 

15 

201 TATCATAGCA GCTTAGAATC CCAGCTGCTG GCTCGGGCT6 CAGTTCTCTC 

251 ATCGTCGCGC AGGGTGATGG CTCGCGCAGG GTGGACCAGC CCCGTTCCTT 

20 301 TATGTGTGT6 TCTACTGCT6 ACCTGTGGCT TT6CCGAGGC AGGGAAGCTG 

351 CTGGTAGTGC CCATGGATGG GAGTCACTGG TTCACCTITGC AGTCGGTGGT 

401 GGAGAAACTT ATCCTCAGGG GGCATGAGGT GGTTGTAGTC ATGCCAGAGG 

25 

451 TGAGTTGGCA ACTG6AAAGA TCACTGAATT GCACAGTGAA GACTTACTCA 

501 ACCTCGTACA CTCTGGAAGA TCAGAACCGG GAATTCATGG TTTTCGCCCA 

30 551 TGCTCAATGG AAAGCACAGG CACAAAGTAT ATTTTCTCTA TTAATGAGTT 

601 CATCCAGTGG TTTTCTTGAC TTATTTTTTT CGCATTGCAG GAGTTTGTTT 

651 AATGACCGAA AATTAGTAGA ATACTTAAAG GAGAGTTCTT TTGATGCAGT 

35 

701 GTTTCTGGAT CCTTTTGATA CCTGTGGCTT AATTGTTGCT AAATATTTCT 

751 CCCTCCCCTC TGTGGTCTTC ACCAGGGGAA TATTTTGCCA CCATCTTGAA 

40 801 GAAGGTGCAC AGTGCCCTGC TCCTCTTTCC TATGTCCCCA ATGATCTCTT 

851 AGGGTTCTCA GATGCCATGA CTTTCAAGGA GAGAGTATGG AACCACATCG 

^01 TGCACTTG6A GGACCATTTA TTTTGCCAGT ATCTTTTTAG AAATGCCCTA 

45 

951 GAAATAGCCT CTGAAATTCT CCAAACCCCT GTCACGGCAT ATGATCTCTA 

1001 CAGTCACACA TCAATTTGGT TGTTGCGAAC GGACTTTGTT TTGGACTATC 

50 1051 CCAAACCCGT GATGCCCAAC ATGATCTTCA TTGGTGGTAT CAACTGTCAT 

1101 CAGGGAAAGC CATTGCCTAT GGTAAGTCAC CTCTCCTTTA GCACATTAAG 

. 1151 AATAATCTGG CTTTGGAATT AAAAAAGGAT TCCTTACTGA ACTGTGATTT 

55 

1201 GACATTTCGT TGTGGCATTC AATTTCTTTC CAGTTTAACA AATTATTTTG 

1251 TGCGAATTCA TGTACTCATC AATTATCAAA TTTTATAAAA CTGCCCTTCT 

60 1301 TGA7AGTATA TGTAATAATT TAAAAATTAT AGATCATATT CAGGCTACAT 
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1351 TTTAAAATAC GATGTTTAGA AAAGTACCAA AAAACCACAG CAAGAAATGA 
1401 AACTTCCGTT TTTTTGTTAT TCTATGTGAC CCCGTAGTTG AAAATGCTCT 
1451 TA 

SEQ ID NO:5 refers to Genbank accesion number U42604, which provides exon 1 of 
UGT1A8: « 



1 GGGCATGATC TGTCCAAGGC AGAGACTATA AGCTACTCTT ATAGTACTCT 

51 TATGAGATAC ATACAAGTAG GTATCTCAAA AAATGATACT CATGTATTCC 

15 101 TGTTCTTATG AGTAAATCAT TGGCAGTGAG TGTGATTTTT TTTTTTTTTA 

151 TGACAGGATC CCTACACGCC CTCTATTGGG GTCAGGTTTT GTGCCTGTAG 

201 TTCTTCCGCC TACGTATCAT AGCAGTTAGA ATCCCAGCTG CTGGCTCGGG 

20 

251 CTGCAGTTCT CTCATGGCTC GCACAGGGTG GACCAGCCCC ATTCCCCTAT 

301 GTGTTTCTCT GCTGCTGACC TGTGGCTTTG CTGAGGCAGG GAAGCTGCTG 

25 351 GTAGTGCCCA TGGATGGGAG TCACTGGTTC ACCATGCAGT CGGTGGTGGA 

401 GAAACTTATC CTCAGGGGGC ATGAGGTGGT TGTAGTCATG CCAGAGGTGA 

451 GTTGGCAACT GGGAAAATCA CTGAATTGCA CAGTGAAGAC TTACTCAACC 

30 

501 TCATACACTC TGGAGGATCT GGACCGGGAA TTCATGGATT TCGCCGATGC 

551 TCAATGGAAA GCACAAGTAC GAAGTTTGTT TTCTCTATTT CTGAGTTCAT 

35 601 CCAATGGTTT TTTTAACTTA TTTTTTTCGC ATTGCAGGAG TTTGTTTAAT 

651 GACCGAAAAT TAGTAGAATA CTTAAAGGAG AGTTCTTTTG ATGCGGTGTT 

701 TCTTGATCCT TTTGATGCCT GTGCGTTAAT TGTTGCCAAA TATTTCTCCC 

40 

751 TCCCCTCTGT GGTCTTCGCC AGGGGAATAG GTTGCCACTA TCTTGAAGAA 

801 GGTGCACAGT GCCCTGCTCC TCTTTCCTAT GTCCCCAGAA TTCTCTTAGG 

45 851 GTTCTCAGAT GCCATGACTT TCAAGGAGAG AGTACGGAAC CACATCATGC 

901 ACTTGGAGGA ACATTTATTT TGCCAGTATT TTTCCAAAAA TGCCCTAGAA 

951 ATAGCCTCTG AAATTCTCCA AACACCTGTC ACAGCATATG ATCTCTACAG 

50 

1001 CCACACATCA ATTTGGTTGT TGCGAACAGA CTTTGTTTTG GACTATCCCA 

1051 AACCCGTGAT GCCCAATATG ATCTTCATTG GTGGTATCAA CTGCCATCAG 

55 1101 GGAAAGCCAT TGCCTATGGT AAGTCACCTC TCCTTTAGCA CATTAGGAAT 

1151 AATCTTGGCT TTGGAAATTA AAAAAAGATT CCTTACTGAA TTGTGATTTG 

1201 ACATTTTCAT TTGTTGCATT TCAAATTTCT TTCCAGTTTA CAGA 

60 



15 



25 
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SEQ ID NO:6 refers to Genbank accesion number AF056188, which provides exon 1 
ofUGTlA9: 

1 CTCAGCTGCA GTTCTCTGAT GGCTTGCACA GGGTGGACCA GCCCCCTTCC 

5 

51 TCTATGTGTG TGTCTGCTGC TGACCTGTGG CTTTGCCGAG GCAGGGAAGC 

101 TACTGGTAGT GCCCATCGAT* GGGAGCCACT GGTTCACCAT GAGGTCGGTG 

10 151 GTGGAGAAAC TCATTCTCAG GGGGCATGAG GTGGTTGTAG TCATGCCAGA 

201 GGTGAGTTGG CAACTGGGAA GATCACTGAA TTGCACAGTG AAGACTTATT 

251 CAACTTCATA TACCCTGGAG GATCTGGACC GGGAGTTCAA GGCTTTTGCC 

301 CATGCTCAAT GGAAAGCACA AGTACGAAGT ATATATTCTC TATTAATG6G 

351 TTCATACAAT GACATTTTTG ACTTATTTTT TTCAAATTGC AGGAGTTTGT 

20 401 TTAAAGACAA AAAATTAGTA GAATACTTAA AGGAGAGTTC TTTTGATGCA 

451 GTGTTTCTCG ATCCTTTTGA TAACTGTGGC TTAATTGTTG CCAAATATTT 

501 CTCCCTCCCC TCC6TG6TCT TC6CCAGGGG AATACTTTGC CACTATCTTG 

551 AAGAAGGTGC ACAGTGCCCT GCTCCTCTTT CCTATGTCCC CAGAATTCTC 

601 TTAGGGTTCT CAGATGCCAT GACTTTCAAG 6AGAGA6TAC GGAACCACAT 

30 651 CATGCACTTG 6AGGAACATT TATTATGCCA CCGTTTTTTC AAAAAT6CCC 

701 TA6AAATAGC CTCTGAAATT CTCCAAACAC CTGTTACGGA GTATGATCTC 

751 TACAGCCACA CATCAATTTG GTTGTTGCGA ACGGACTTTG TTTTGGACTA 

35 

801 TCCCAAACCC GTGATGCCCA ACATGATCTT CATTGGTG6T ATCAACTGCC 

851 ATCAGGGAAA GCCGTTGCCT ATGGAATTTG AAGCCTACAT TAATGCTTCT 

40 901 GGAGAACATG GAATTGT6GT TTTCTCTTTG GGATCAATGG TCTCA6AAAT 

951 TCCAGAGAAG AAAGCTATGG CAATQ?GCTGA TGCTTTGG6C AAAATCCCTC 

1001 AGACAGTCCT 6TGGCGGTAC ACTGGAACCC GACCATCGAA TCTTGCGAAC 

1051 AACACGATAC TTGTTAAGT6 GCTACCCCAA AAC6ATCTGC TTGGTCACCC 

1101 GATGACCCGT GCCTTTATCA CCCATGCTGG TTCCCATGGT GTTTATGAAA 

50 1151 GCATATGCAA TGGC6TTCCC ATGGTGATGA TGCCCTTGTT TGGTGATCAG 

1201 ATGGACAATG CAAAGCGCAT GGAGACTAAG GGAGCTG6AG TGACCCTGAA 

1251 TGTTCTGGAA ATGACTTCTG AA6ATTTAGA AAATGCTCTA AAAGCAGTCA 

55 

1301 TCAATGACAA AAGTTACAAG GAGAACATCA TGCGCCTCTC CAGCCTTCAC 

1351 AA6GACCGCC CGGTGGAGCC GCTGGACCTG GCCGTGTTCT GG6TGGAGTT 

60 1401 T6TGATGAGG CACAAGGGCG CGCCACACCT GCGCCCCGCA GCCCACGACC 

1451 TCACCTGGTA CCAGTACCAT TCCTTGGACG TGATTGGTTT CCTCTTGGCC 

1501 GTCGTGCTGA CAGTGGCCTT CATCACCTTT AAATGTTGTG CTTATGGCTA 

65 



45 
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1551 CCGGAAATGC TTGGGGAAAA AAGGGCGAGT TAA6AAAGCC CACAAATCCA 

1601 AGACCCATTG AGAAGTGGGT GGGAAATAAG GTAAAATTTT GAACCATTCC 

5 1651 CTAGTCATTT CCAAACTTGA AAACAGAATC AGTGTTAAAT TCATTTTATT 

1701 CTTATTAAGG AAATACTTT6 CATAAATTAA TCAGCCCCAG A6TGCTTTAA 

1751 AAAATTCTCT TAAATAAAAA TAATAGACTC GCTAGTCAGT AAAGATATTT 
10 I 

1801 GAATATGTAT CGTGCCCCCT CCGGTGTCTT TGATCAG6AT GACATGTGCC 

1851 ATTTTTCA6A GGAC6TGCAG ACAGGCTGGC ATTCTAGATT ACTTTTCTTA 

15 1901 CTCTGAAACA TG6CCTGTTT GGGAGTGCGG GATTCAAAG6 TGGTCCCACC 

1951 GCTGCCCCTA CTGCAAATGG CAGTTTTAAT CTTATCTTTT GGCTTCTGCA 

2001 GATGGTTGCA ATTGATCCTT AACCAATAAT GGTCA6TCCT CATCTCTGTC 

20 

2051 CTGCTTCATA GGTGCCACCT T6TGTGTTTA AAGAAGGGAA GCTTTGTACC 

2101 TTTA6AGTGT AGGTGAAATG AATGAATGGC TTGGAGTGCA CTGAGAACAG 

25 2151 CATATGATTT CTTGCTTTGG GGAAAAA6AA TGATGCTATG AAATTGGTGG 

2201 GTGGTGTATT TGAGAAGATA ATCATTGCTT AT6TCAAATG GAGCTGAATT 

2251 a?GATAAAAAC CCAAAATACA GCTATGAAGT GCTGGGCAAG TTTACTTTTT 

30 

2301 TTCTGATGTT TCCTACAACT 

SEQ ID NO:7 refers to Genbank accesion number M84122, which provides the 
mtronofUGTlA: 

35 

1 TTTGCATCTC AAGGATAATT CTGTAAGCAG GAACCCTTCC TCCTTTAGAA 

51 GGAAGTAAAG GAGAGGAAAA TGCTGTAAAA CTTACATATT AATAATTTTT 

40 101 TACTCTATCT CAAACACGCA TGCCTTTAAT CATAGTCTTA AGAG6AAGAT 

151 ATCTAATTCA TAACTTACTG TATGTAGTCA TCAAA6AATA TGAGAAAAAA 

201 TTAACTGAAA ATTTTTCTTC TGGCTCTAGG AATTTGAAGC CTACATTAAT 

45 

251 GCTTCTGGAG AAGATG6AAT TGTGGTTTTC TCTTTGG6AT CAATGGTCTC 

301 AGAAATTCCA GAGAAGAAAG CTATGGCAAT TGCTGATGCT TTGQGCAAAA 

50 351 TCCCTCAGAC A6TAAGAAGA TTCTATACCA TGGCCTCATA TCTATTTTCA 

401 CAGGAGCGCT AATCCCAGAC TTCCAGCTTC CAGATTAATT CTCTTAATTG 

451 GAACCTTAGA TTTGGCTTTT CCCTGCCACT TCCCAACTAT TAATCCAAAG 

501 GTTTTTTTT6 TT 

SEQ ID NO:8 refers to Genbank accesion number M84123, which provides exon 4 
UGTIA: 



55 



60 



1 AAAGATGTCC TCAAGGGACC CTGTTTTCTA 6TTAGTATAG CAGATTTGTT 
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5 



15 



25 



51 


TTCTAATCAT 


ATTATGTCTT TCTTTACGTT 


CTGCTCTTTT 


GCCCCTCCCA 


101 


GGTCCTGTGG 


CGGTACACTG GAACCCGACC 


ATCGAATCTT 


GCGAACAACA 


151 


CGATACTTGT 


TAAGTGGCTA CCCCAAAACG 


ATCTGCTTGG 


TATGTTGGGC 


201 


GGATTGGATG 


TATAGGTCAA ACCAGGGTCA 


AATTAAGAAA 


ATGGCTTAAG 


251 


CACAGCTATT 


CTAAAGGATT 1 GTTGAGCTTG 


AAAATATTAT 


GGCCAACATA 


301 


TCCTACATTG 


CTTTTTATCT AGTGGGGTAT 


CTCAACCGAC 


ATTTTCTTCT 


351 


GCAAATTTCT 


GCAAGGGCAT GTGAGTAACA 


CTGAGTCTTT 


GGAGTGTTTT 


401 


CAGAACCTAG 


ATGT6TCCAG CTGTGAAACT 


CAGAGATGTA 


ACTGCTGACA 


451 


TCCTCCCTAT 


TTTGCATCTC AGGTCACCCG 


AT6ACCCGTG 


CCTTTATCAC 


501 


CCATGCTGGT 


TCCCATGGTG TTTATGAAAG 


CATATGCAAT GGCGTTCCCA 


551 


TGGTGATGAT GCCCTTGTTT GGTGATCAGA 


TGGACAATGC 


AAAGCGCATG 


601 


GAGACTAAGG 


6AGCTGGAGT GACCCTGAAT 


GTTCTGGAAA T6ACTTCTGA 


651 


AGATTTAGAA AATGCTCTAA AAGCAGTCAT 


CAAT6ACAAA AGGTAAGAAA 


701 


GAAGATACAG 


AA6AATACTT TGGTCATGGC 


ATTCATGATA AAATTGTTTC 


751 


AAATAT6AAA ACATTTACGT AGCATTTAAT 


ACGT 





The method in accordance with the present invention can be performed using any 
suitable method for detecting single nucleotide variations, such as e.g. allele specific 
ampUfication (i.e. ARMS™-allele specific amplification; ARMS referring to amplification 
35 refiractory mutation system), allele specific hybridisation (ASH), oligonucleotide hgation 
assay (OLA) and restriction firagment length polymorphism (RFLP). 

The status of a human being may be determined by reference to allelic variation at 
position 908 in exon 5 as defined by the position in SEQ ID NO:l and, if necessary, at one 
or more additional positions displaying a polymorphism. 

40 The test sample of the nucleic acid carrying the said polymorphism is conveniently a 

sample of blood, bronchoalveolar lavage fluid, sputum, urine or other body fluid or tissue 
obtained firom an individual. It will be appreciated that the test sample may equally be a 
nucleic acid sequence corresponding to the sequence in the test sample, that is to say that 
all or a part of the region in the sample nucleic acid may firstly be amplified using any 

45 convenient technique, e.g. polymerase chain reaction (PGR) or ligase chain reaction (LCRX 
before analysis of allelic variation. 

It win be apparent to the person skilled in the art that there are a large number of 
analytical procedures which may be used to detect the presence or absence of variant 



wo 02/06523 



PCT/EPOl/07524 



-16- 

nucleotides at one or more polymorphic positions of the invention. In general, tiie 
detection of allelic variation requires a mutation discrimination technique, optionally an 
amplification reaction and optionally a signal generation system. International patent 
application WO 00/06768 lists a number of amplification techniques and mutation 

5 detection techniques, some based on PGR. These may be used in combination with a 
number of signal generation systems, a selection of which is also listed in WO 00/06768. 
Many current methods for the detection of allelic variation are reviewed by Nollau et aL, 
Clin. Chem. 43, 1114-1120, 1997; and in standard textbooks, for example "Laboratory 
Protocols for Mutation Detection", EA by U. Landegren, Oxford University Press, 1996 

10 and "PGR", 2nd Edition by Newton 8c Graham, BIOS Scientific Publishers Limited, 1997, 

The invention also relates to diagnostic nucleic adds comprising within their 
sequence the polymorphism at position 908 of exon 5 of UGTl (SEQ ID N0:1), the 
polymorphism at position 754 in exon 1 (SEQ ID NO:2) or the polymorphism at position 
765 of exon !• 

15 The term "diagnostic nucleic acid" refers to a nucleotide sequence of at least 17 

nucleotides in length which corresponds to part or all of the human UGTl gene. The 
diagnostic nucleic acid is preferably a part of the human UGTl gene which part expresses 
the polymorphism. A lengtii of 17 to 100 nucleotides is preferred. 

Furthermore the invention relates to allde specific primers which can be used as 
20 diagnostic primers for detecting a polymorphism in the UGTl gene capable of hybridizing 
to nucleic adds comprising witiiin their sequence the polymorphisms as defined above. 

An allele specific primer is used, generally together with a constant primer, in an 
ainplification reaction such as a PGR reaction, which provides the discrimination between 
alleles through selective amplification of one allde at a particular sequence position e.g. as 
25 used for ARMS™ assays. The length of the allde specific primer is preferably 17- 50 
nucleotides, more preferably about 17-35 nudeotides, most preferably about 17-30^ 
nucleotides. 

Preferably the allde specific primer corresponds exactly with the allde to be detected 
but derivatives thereof are also contemplated wherein about 6-8 of the nucleotides at the 3* 
30 terminus correspond with the allele to be detected and wherein up to 10, such as up to 8, 6, 
4, 2, or 1 of the remaining nudeotides may be varied without significantiy affecting the 
properties of the primer. Often the nudeotide at the -2 and/or -3 position (relative to the 3' 
terminus) is mismatched in order to optimize differential primer binding and preferential 
extension firom the correct allele discriminatory primer only. 
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Suitable examples of such diagnostic allele specific primers are the foDowing: 



UGT1A6T181S: 
UGT1A6T181S: 
UGT1A6R184S: 
UGT1A6 R184S: 



CGTGTTCCCTGGAGGATA (SEQ ID NO:24) 

CGTGTTCCCTGGAGCATG (SEQ ID NO:25) 

GACACAGGGTCTGGGCTT (SEQ ID NO:27) 

GACACAGGGTCTGGGCTG ' (SEQ ID NO:28) 

UGTlA-3' 908-2: TGCAGTAGGGGCAGCG (SEQ ID NO:30) 

UGTlA-3' 908-2 TGCAGTAGGGGCAGCG (SEQ ID N0:31>. 

Any convenient method of synthesis may be used to manu&cture primers. Examples 
of such methods may be found in standard textbooks, for example '^Protocols for 
10 Oligonucleotides and Analogues; Synthesis and Properties," Methods in Molecular Biology 
Series; Volume 20; Ed. Sudhir Agrawal, Humana ISBN: 0-89603-247-7; 1993; 1st Edition. If 
required primers may be labelled to facilitate detection. 

Furthermore the invention relates to allde-^edfic oligonucleotide probes for 
detecting a polymorphism in the UGTI gene capable of hybridizing to diagnostic nudeic 
15 adds comprising within their sequence the polymorphisms as defined above. 

The length of the allele-spedfic oligonucleotide probes are preferably 17- 50 
nudeotides, more preferably about 17-35 nudeotides, most preferably about 17-30 
nudeotides. 

The design of such probes will be apparent to the person skilled in the art In general 
20 such probes will comprise base sequences entirely complementary to the corresponding 
wild type or variant locus in the gene. However, if required one or more mismatches may 
be introduced, provided that the discriminatory power of the oligonucleotide probe is not 
imduly affected. The probes of the invention may carry one or more labels to facilitate 
detection. 

25 Furthermore the invention rdates to diagnostic kits comprising one or more allde- 

spedfic oligonucleotide primers or aHde-spedfic oligonucleotide probes for detecting a 
polymorphism in the UGTI gene. 

The diagnostic kits may comprise appropriate packaging and instructions for use in 
the methods of the invention. Such kits may further comprise one or more appropriate 
30 buffers and one or more polymerases such as thermostable polymerases, for example Taq 
polymerase. Such kits may also comprise companion/constant primers and/or control 
primers or probes. A companion/constant primer is one that is part of the pair of primers 
used to perform PGR. Such primer usually complements the template strand precisdy. 
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Furthermore the invention relates to a pharmaceutical pack comprising a 
pharmaceutically active compound like Tolcapone and instructions for administration of 
the drug to human beings diagnostically tested for a single nucleotide polymorphism 
according to a method of the present invention. 

5 Furthermore the invention relate^ to a computer readable medium having stored 

thereon a sequence information for the polymorphism at position 908 of exon 5 of UGTl. 

This invention further relates to a method for performing sequence identification, 
which methods comprise the steps of providing a nucleic add sequence carrying e. g. the 
polymorphic site of position 90& of exon 5 or a complementary strand thereof or a 
10 fragment thereof of at least 20 bases; and comparing said nucleic add sequence to at least 
one oAer nudeic add or polypeptide sequence to identify identity. 

The invention is further illustrated by the following figures: 

Figure 1 shows the primary metaboHc routes of tolcapone in the Hver. Tolcapone is 
oxidized by cytochrome P450 3A4 (CYP3A4), the nitro group is reduced and acetylated by 
15 N-acetyltransferase (SAT). The phenolic hydroxy group can be sulfated by sulfotransferase 
(ST) or meth^ated by catechol-O-methyl transferase (COMT). Glucuronidation of the 
hydroxy group , a major reaction of detoxification in the Uver, is catalyzed by UDP- 
glucuronosyltransferase (UGT). Subsequent oxidation or conjugation with glucuronate, 
sulphate and acetate further modifies primary metabolites. 

20 Figure 2 represents the UGTIA gene structure. The UGTIA gene spans more than 

500kb, and consists of at least 12 promoters and first exons which can be spliced with the 
common exons to result in 12 diflFerent UGTIA enzymes. The structure of the UGT1A6 
transcript is shown bdow. The arrows indicate the rdative position of the polymorphic 
markers used in this study. UGT-3*_908 represents the polymorphism at position 908 in 

25 exon 5 as defined by the position in SEQ ID NO:l. This polymorphism in the 3'UTR 
(untranslated region) can potentially affect the expression of all nine functional UGTIA 
enzymes. The other two polymorphisms in exon 1A6 affect the protein structure of 
UGT1A6. 



wo 02/06523 



PCT/EPOl/07524 



-19- 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by the person skilled in the art to which this invention 
belongs. 

The following examples are provided for illustration of the invention, but are not 
5 intended to limit the scope of what is regarded as the invention. 

Examples 

Analysis of tolcapone induced livertoxicity 
Selection of patients 

The study protocol and the informed consent form were submitted for approval to 
10 the local ethical committees in the respective countries. AD patients provided written 

informed consent for their blood sample to be used for genotypying. The consent could be 
withdrawn up to a month later, if the patients changed their mind. 

All the samples were assigned new independent codes and a month after sample 
collection the link between the new and original codes was deleted. This was an added 
15 measure to ensure patient confidentiality, however, as a consequence it is not possible to 
retrieve genotype information based on the patient's name or number used in the original 
dinical trial. In approximately 15 years time, aU blood and DNA samples will be destroyed. 

Initially, 645 patients who had received tolcapone in previous clinical trials were 
considered for inclusion in this retrospective genetic analysis. This included 215 patients 

20 who had displayed liver enzyme levels of >1.5x the upper limit of normal (ULN) and 430 
patients who had normal liver enzyme levels. Each patient with elevated liver trans- 
aminases (ELT) was matched with at least two control patients firom the same study for 
gender and age (ethnic matching was not necessary since the great majority of the target 
patients were Caucasians). Disease severity had already been controlled for by the original 

25 study inclusion criteria. Of the 215 ELT patients, 135 ELT patients were enrolled in the 
study, 31 patients did not participate because their respective investigators could not 
obtain ethical approval to conduct the study, and 49 patients were either lost to follow-up 
or deceased. In total, 409 patients participated in this pharmacogenetic analysis. The 
distribution of the patients across the different sites and countries is shown in Table 2. 
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Table 2: patient distribution 

Country No of Sites NoofELT No of Controls Total 
patients patients 



Austria 


2 


2 


7 


9 


Australia 


1 


2» 


4 


6 


Canada 


14 


31 


74 


106 


Switzerland 


5 


4 


10 


14 


Germany 


9 


11 


29 


40 


Denmark 


4 


8 


13 


21 


Spain 


4 


4 


6 


10 


UK 


4 


8 


10 


18 


Italy 


5 


12 


23 


35 


Norway 


2 


5 


4 


9 


USA 


20 


48 


94 


142 


Total 


70 


135 


274 


410 



Preparation of the samples 

Single blood samples (9ml) were collected in EDTA tubes. These were frozen and 
5 stored between -20 and -70'C, before being sent to the Roche Central Sample Office (CSO) 
in Basel, Switzerland, where Aey were aliquoted into three tubes and assigned new, 
independent codes on bar code labels to assure patient anonymity. Two samples of blood 
(1ml and 4mls) were sent to the Roche Sample Repository (RSR) at Roche Molecular 
Systems (RMS) in Alameda, California. The remaining 4ml aliquot was stored at -80 ''C in 
10 the CSO in Basel, Switzerland. AD procedures performed on the samples at the RSR were 
done according to established standard operating procedures using GCP guidelines. 

DNA was extracted from 400 (il of the whole blood using a sflica gel-based extraction 
method (QiaAmp DNA Blood kit, Valencia, CA). Controls included 10 mM Tris pH 8.0, 1 
mM EDTA (TE) buffer and whole blood from a blood unit with a known yield of DNA. 

15 Samples were genotyped for eight different single nucleotide polymorphisms (SNPs) 

using a combination of the amplification refractory mutation system (ARMS) that relies on 
3' terminal mismatches between the PGR primers and the template being amplified 
according to Nevyton et aL, Nuddc Acids Res. (1989), 17(7), 2503-16. 

Analysis of any point mutation in DNA was transformed by using the amplification 
20 refractory mutation system (ARMS, Nucleic Acids Res. (1989), 17(7), 2503-16) and using 
the kinetic thermal cycler (KTC) format of the polymerase chain reaction. This method 
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allows discrimination of single nucleotide polymorphisms (SNP) in a single-tube without 
the use of fluorescent probes (Higuchi et al., Biotedmology (1993), 11, 1026-1030). 

In the KTC format, the generation of double-stxanded amplification product is 
monitored using a DNA intercalating dye and a thermal cycler which has a fluorescence- 
5 detecting CCD camera attached (PE-Bipsystems GeneAmp 5700 Sequence Detection 
System). Fluorescence in each well of the PCR amplification plate is measured at each cycle 
of annealing and denaturation. The cyde at which the relative fluorescence reached a 
threshold of 0.5 using the SDS software from PE-Biosystems was defined as the Ct. 

The amplification reactions were designed to be allde-specific, so that the 
10 amplification reaction was positive if the polymorphism was present and the amplification 
reaction was. negative if the polymorphism was absent For each bi-allelic polymorphism, 
one weD of tfie amplification plate was set up to be specific for allele 1 and a second wdl 
was set up to be specific for allele 2. For.eadi polymorphism to be detected, three pruners 
were designed - two allde-spedfic primers and one common primer (Table 3). Reactions 
15 for allde 1 contained allde 1-specific primer and the common primer and reactions for 
allde 2 contained allde 2-specific primer and the common primer. 



Table 3: list of oligonudeotide primers used for polymorphism detection 





Primer 


Nucleotide sequence 


SEODD 


Primer 


Annealing 




tm 




NO 


concentration temperature 










(^n^M) 




COMTV158M 


ASl 


GCACACCrTGTCCrrCAT 


9 


0.4 


58 


COMTV158M 


AS2 


GCACAccrrGTccrrcAC 


10 


0.4 


58 


COMTV158M 


common 


CATCACCATCGAGATCAAC 


11 


0.4 


58 


CYP3A4A/G 


ASl 


CTATTAAATCGCCTCTCTCT 


12 


0.4 


56 


CYP3A4A/G 


AS2 


CTATTAAATCGCCTCTCrCC 


13 


0.4 


56 


CYP3A4A/G 


common 


GGATGAATTTCAAGTATTT 


14 


0.4 


56 


MbS0DV-9A 


ASl 


AGCCCAGATACCCCAAAG 


15 


0.4 


58 


MnSODV-9A 


AS2 


AGCCCAGATACCCCAAAA 


16 


0.4 


58 


MnS0DV-9A 


common 


TGTGCTTTCTCGTCTrCA 


17 


0.4 


58 


NAT2I114T 


ASl 


TCTAATTCCTGCCGTCAG 


18 


0.2 


58 


NAT2I114T 


AS2 


TGTAATTCCTGCCGTCAA 


19 


0.2 


58 


NAT2I114T 


common 


ATACAGCACTGGCATGG 


20 


0.2 


58 


SULTlAl R213H 


ASl 


CCTGGAGTTTGTGGGGCG 


21 


0.2 


58 


SULTlAl R2i3H 


AS2 


CCTGGAGTTTGTGGGGCA 


22 


0.2 


58 


SULTlAl R213H 


common 


TGAACCATGAAGTCCACG 


23 


0.2 


58 


UGT1A6T181S 


ASl 


CGTGTrCCCTGGAGCATA 


24 


0.2 


58 


UGT1A6T181S 


AS2 


CGTGTTCCCTGGAGCATG 


25 


0.2 


58 
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UGTIA6T181S 


common 


GAATGTAGGACACAGGGTCT 


26 


0.2 


58 


UGT1A6R184S 


ASl 


GACACAGGGTCTGGGCTT 


27 


0.2 


58 


UGT1A6R184S 


AS2 


GACACAGGGTCTGGGCTG 


28 


0.2 


58 


UGT1A6R184S 


common 


TACCTCTTCAGGGG'lTnC 


29 


0.2 


58 


UGTlA-3' 908-2 


ASl 


TGCAGTAGGGGCAGCG 


30 


0.2 


58 


UGTlA-3' 908-2 


AS2 


TGCAGTAGGpGCAGCC 


31 


0.2 


58 


UGTlA-y 908-2 


common 


GGAGTGCGGGATTCAA 


32 


0.2 


58 



The amplification conditions were as follows: 10 mM Tris pH 8.0, 40 mM KCl, 2 mM 
MgCl2, 50 Jim each of dATP> dCTP, and dGTP, 25 fim of TTP and 75 Jim of dUTP, 4% 
DMSO, 0.2X SyBr Green (Molecular Probes, Eugene, OR), 2% glycerol, uracil N- 
5 glycosylase (UNG, 2 units), Stoffel Gold DNA polymerase ( 1 5 units, for reference see 
Nature (1996), 381 > 445-6) and primers in an 85 volume for each well. The concen- 
tration of the primers used for each assay are listed in Table 2. 30 ng of DNA in a 15 jil 
volume was then added to each weU. 

To reduce the possibility of contamination by pre-existing amplification product, the 
10 assay procedure included the incorporation of dUTP into the amplification product and an 
incubation step for UNG degradation of pre-existing U-containing products (Longo et al, 
Gene (1990), 23.125-128). 

Amplification reactions were prepared using an aliquoting robot (Packard 
Multiprobe II, Meriden, CT) in 96-weD amplification plates identified by barcode labels 
15 generated by the experiment management database^ Parameters for procedures performed 
by the robot were set to minimize the possibility of cross-contamination. For each plate of 
81 samples, 5 samples were run in duplicate and the duplicate results were analysed to 
detennine that they matched. 

The tibermal cyding conditions were as foDows: 5 minutes at 50 X for UNG 
20 degradation of any previously contaminating PGR products, 12 minutes at 95 X for Stoflfd 
Gold polymerase activation, 55 cycles of denaturation at 95 X and anneaHng at the 
annealing temperature indicated in Table 2, followed by a dissociation step of 1 minute at 1 

degree increments firom 60 ""C to 95 ''C. The amplification reactions were run in PE 
Biosystems GeneAmp 5700 Sequence Detection Systems (SDS) instruments (Foster City, 
25 CA). The first derivatives of the dissociation curves were produced by the SDS software and 
examined as needed to confirm that the fluorescence in a given reaction was due to 
amplification of a specific product with a well-defined dissociation peak rather than non- 
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specific primer-dimer. Product dififerentiation was done by Analysis of DNA Melting 
Curves during PGR following the method of ICM. Ririe et al.. Anal. Biodiem. (1997), 245. 
154-160. 

The Ct of each amplification reaction was determined and the difference between the 
5 Q for aUde 1 and allele 2 (delta Q) was^used as the assay result. Samples with delta Qs 
between -3.0 and 3.0 were considered heterozygous (A1/A2). Samples with delta CtS below 
-3.0 were considered homozygous for Al (Al/Al); samples with delta Qs above 3.0 were 
considered homozygous for A2 (A2/A2). In most cases> the delta Q diflferences between the 
three groups of genotypes were well-defined and samples with Q values dose to 3.0 were 
10 re-tested as discrepants. 

Each assay was run on a panel of 14 cell line DNAs to identify cell lines with the 
appropriate genotypes for use as controls on each assay plate (Al/Al, A1/A2, and A2/A2). 
The cdl line DNA was obtained from the Human Genetics Department, Roche Molecular 
Systems (KMS) Alameda, CA and was ejctracted using the Qiagen extraction kits (QiaAmp 

15 DNA Blood kits, Vdenda, GA). The genotypes of the cell line DNAs were confirmed by 
DNA sequencing. Three cell line DNAs (Al/Al, A1/A2, and A2/A2) were run as controls 
on each plate of clinical trial samples and used to determine the between-plate variability. 
In addition, DNA fi:om two cell lines were run in quadruplicate for each assay to determine 
the within-plate assay variabOity. The Ct values obtained for the control cell lines were 

20 analyzed to determine the cutoff for the ddta Q values obtained for the clinical trial 
samples. 

A data file containing the Q values for each well was generated by the SDS software 
and entered into the experiment management database. A data file with the final genotypes . 
identified by the independent code was extracted from the database and matched to the 
25 clinical data also identified by the independent code for the statistical analysis. 

For all other single nudeotide polymorphisms (SNPs) discovery and genotyping was 
done by double-stranded DNA sequencing using an ABI capillary sequencer and Big Dye 
chemistry (ABI>. The primers used to amplify all exons are shown bdow and were also 
used as sequencing primers. Publidy available genomic sequences were used as references 
30 for primer design. All polymorphisms were targeted with these pairs-of-primer sets: 

UGT1A6-1 fragment UGT1A6-F1 ACACGGCCATACTTGGTTCA (SEQ II> NO:33) 

UGT1A6-R1 CAGTTGATGAAGTACAGGCC (SEQIDNO:34) 



wo 02/06523 



PCT/EPOl/07524 



-24- 

UGT1A6-2 fragment UGT1A6-F2 TGTAGTGGTGGTGCXTGAAG (SEQ ID NO:35) 



UGT1A6-R2 



GACAGCTGATGCGAGTTCTTC (SEQ ID NO:36) 



UGTIA7-1 fragment: UGriA7-Fl GAGGGCAGGTTCTATCGTAC (SEQIDNO:37) 

UGT1A7-R1 I GGGCACTGTGCACCTTCTTC (SEQIDNO:38) 
5 UGT1A7-2 fragment UGTIA7-F2 ACGGCACCATTGCGAAGTGC (SEQIDNO:39) 



UGTIA8 fragment 



UGTl A9 fragment 



10 



UGT1A7-R2 

UGTIAS-F 

UGTIA8-R 

UGT1A9-F 

UGT1A9.R 



ACTTACATATCAACAAGTGCTGC (SEQIDNO:40) 



GGGCATGATCTGTCCAAGGC (SEQ ID NO:41> 



GGTTGAGTAAGTCTTCACTGTG (SEQ ID NO:42) 



CrCAGCrGCAGTTCTCrG 



(SEQIDNO:43) 



CCAGATCXrrCCAGGGTATATG (SEQ ID NO:44) 



UGTlAlOfr^ent UGTlAlO-F GAGTTCATCCAGTGGTTTTC (SEQIDNO:45) 



UGTlin fragment 



15 UGTlex4 fra^ent 



UGTIAIO-R 

UGTlAin-F 

UGTlAin-R 

UGTlex4-F 

UGTle3c4-R 



CAGTTCAGTAAGGAATCC 



(SEQIDNO:46) 



CAAGGATAATTCTGTAAGCAGG (SEQ ID NO:47) 



GGATTAATAGTTGGGAAGTGGC (SEQ ID NO:48) 



GGCCAACATATCCTACATTG (SEQIDNO:49) 



CGTATrAAATGCTACGTAAATGT (SEQIDNO:50) 



20 



UGTI-CJC5-1 fragment UGTlexS-l-F CAGTTAGCCATGCTTGTGCG (SEQIDNO-31) 

UGTlex5-I-R GCACTCTGGGGCTGATTAAT (SEQIDNO:52> 

UGTl-e3c5-2 fragment UGTlex5-2-F CGTGCTGACAGTGGCCTTC (SEQIDNO:53> 

UGTlaS-2-R CAGTGCACTCCAAGCCATrC (SEQIDNO:54) 



UGTl-e3c5-3 fragment UGTlex5-3-F GATGGTTGCAATTGATCC 



(SEQIDN055) 



UGTlex5-3-R TTAGTTGTAGGAAACATCAG (SEQIDNO:56) 
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Primer UGT1A6-F1 corresponds to positions 3 to 22 in exon 1 of UGT1A6 as 
deiBned by the positions in SEQ ID N0:2. Primer UGT1A6-R1 corresponds to the 
complementary strand and hybridizes to positions 571 to 590 as defined by the positions in 
SEQ ID NO:2. Primer UGT1A6-F2 refers to positions 381 to 400 in exon 1 of UGT1A6 as 
5 defined by the positions in SEQ ID N0:2. Primer UGTl A6-R2 corresponds to the 

complementary strand and hybridizes to positions 901 to 921 as defined by the positions in 
SEQ ID NO:2. 

Primer UGT1A7-F1 corresponds to positions 78 to 98 in exon 1 of UGT1A7 as 
defined by the positions in SEQ ID NO:3. Primer UGT1A7-R1 corresponds to the 
10 complementary strand and hybridizes to positions 696 to 715 as defined by the positions in 
SEQ ID NO:3. Primer UGT1A7-F2 corresponds to positions 459 to 478 in exon 1 of 
UGT1A7 as defined by the positions in SEQ ID NO:3. Primer UGT1A7-R2 corresponds to 
the complementary strand and hybridizes to positions 1190 to 1212 as defined by the 
positions in SEQ ID NO:3. 

15 Primer UGT1A8-F corresponds to positions 1 to 20 in exon 1 of UGT1A8 as defined 

by tiie positions in SEQ ID N0:5. Primer UGT1A8-R hybridizes to positions 479 to 500 as 
defined by the positions in SEQ ID N0:5. 

Primer UGT1A9-F corresponds to positions 1 to 18 in exon 1 of UGT1A9 as defined 
by the positions in SEQ ID NO:6. Primer UGT1A9-R hybridizes to positions 257 to 277 as 
20 defined by the positions in SEQ ID NO:6. 

Primer UGTIAIO-F corresponds to positions 596 to 615 in exon 1 of UGT1A7 as 
defined by the positions in SEQ ID N0:4. Primer UGTIAIO-R hybridizes to positions 1177 
to 1 194 as defined by the positions in SEQ ID N0:4. 

Primer UGTlAin-F corresponds to positions 10 to 31 in the intron of UGTIA as 
25 defined by the positions in SEQ ID NO:7. Primer UGTlAin-R hybridizes to positions 475 
to 496 as defined by the positions in SEQ ID NO:7. 

Primer UGTlex4-F corresponds to positions 291 to 310 in exon 4 of UGTIA as 
defined by the positions in SEQ ID N0:8. Primer UGTlex4-R hybridizes to positions 761 
to 784 as defined by the positions in SEQ ID NO:8. 

30 Primer UGTlex5-l-F corresponds to positions 63 to 82 in exon 5 of UGTl as 

defined by the positions in SEQ ID NO:L Primer UGTlex5-l-R hybridizes to positions 
684 to 703 as defined by the positions in SEQ ID NO:L Primer UGTlex5-2-F corresponds 
to positions 461 to 480 in exon 5 of UGTl as defined by the positions in SEQ ID NO:L 
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Primer UGTlex5-2-R hybridizes to positions 1082 to 1 101 as defined by the positions in 
SEQ ID NO:L Primer UGTlex5-3-F corresponds to positions 959 to 976 in exon 5 of 
UGTl as defined by the positions in SEQ ID NO:l and primer UGTlex5-3-R hybridizes to 
positions 1261 to 1280 as defined by the positions in SEQ ID NO:L 

5 Fourty nanograms of genomic dUa were PCR-ampKfied in 50 jil reactions using an 

automated PGR machine. Reaction conditions varied as follows. For the amplification of 
the UGTlA6-fi:agment conditions were as follows: 10 mM Tris pH 8.3, 50 mM KCl, 
L5 mM MgClz, 0.2 mM of each dNTP, 0.4 \M of each primer and 1.5U Boehringer Taq 
Polymerase. The thermocycUng protocol consisted of an initial incubation of 95 °C for 

10 15 min. followed by 35 cycles of 94 for 1 min., 57 ^'C for 30 sec, 72 for 1 min., and 
one final eictension step of 72 °C for 10 min. The UGT1A7-1 firagment was amplified using 
Qiagen PGR buffer with 1.5 mM Mga2 , 0.2 mM of each dNTP, 0.4 (iM of each primer and 
1.5U Boehringer Taq Polymerase. The thermocyding protocol was Ae same as for 
UGTl A6-fi:agment with one exception: the annealing temperature was 61 **C. For 

15 UGT1A7-2 firagment PGR conditions were as follows: 150 mM Tris pH 8.5, 15 mM 
(NH4)2S04, 3.5 mM MgGl2, 0.2 mM of each dNTP, 0.4 vM of each primer and 1.5U 
Qiagen Hot Start Taq Polymerase. Thermocyding was done using a touch-down PGR 
protocol. After an iniatial amplification of 95 °C for 10 min. followed 5 cydes of 95 "^C for 
lmin.> 62 ^'C for 30 sec. (minus 0.5 ^'G per cycle), 72 ^G for 1 min. and thirty cydes of 95 **G 

20 for 1 min., 60 ""C for 30 sec, 72 ^'C for 1 min. and a final extension step of 72 **G for 10 min. 
After PGR amplification firagments were purified using the Qiaquick PGR purification kit 
on a Biorobot 9600. Gyde sequendng was performed on an automated PGR machine using 
ABI Big Dye terminator chemistry according to the manufacturer's intruction with the 
following changes: 2.5-5 ng/100 bp of PGR product were mixed with 2 jbll Big Dye 

25 terminatior mix, oligonudeotide primer concentration was 10 pmol, if necessary 5% 
DMSO was added to the reaction; the final reaction volume was 10 pi. Sequencing 
reactions were subjected to 28 cydes at 93 ""C for 30 sec, 48 ""C for 30 sec, and 58 °G for 
120 sec, followed by an ethanol/NaOAc predpitation. After decanting the ethanol, samples 
were evaporated to dryness using a SpeedVac for 2 min. and were resuspended in 45 lA 

30 ultrapure water (MERGK, HPLG grade). 2.5 ml were loaded on an ABI 3700 capillary 
sequencer using POP5 as a polymer. After sequendng, the polymorphism analyses were 
done using Polyphred software (licenced firom University of Washington). 

Selection and discovery of genetic markers 

The genetic markers were sdected based on the known pharmacology of tolcapone 
35 and knowledge firom the literature of genetic polymorphisms that could affect the activity 
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of corresponding and relevant gene products. The main metabolic pathway for tolcapone 
ehmination is glucuronidation by UGTl enTymes. 

In addition to polymorphisms in the UGTl enzyme, genetic polymorphisms in genes 
encoding the following enzymes involved in tolcapone metabolism (Figure 1) were 

5 selected: catechol-O-methyl transferase^ (COMT) according to Lachman et al.. 

Pharmacogenetics ( 1996), 6, 243-250; N-acetyl transferase (NAT2, for reference see Vatsis 
et al., Proc. Natl. Acad. Sd. (1991), SS, 6333-6337; Bandmann et al., Lancet (1997), 350, 
1136-1139); liver sulfotransferase (SULTlAl) according to Ozawa et al., Qiem. Biol. 
Interact (1998), Ifig, 237-248 and Cytochrome P450 enzyme (CYP3A4, Rebbedk et al., J. 

10 Natl. Cancer Inst (1998), 5Q, 1225- 1229). The genotype for manganese superoxide 
dismutase (MnSOD, Shimoda-Matsubayashi et al., Biochem. Biophys. Res. Commun. 
(1996), 226. 561-565), involved in oxidative stress response, was also investigated. 

UDP-glucuronosyltransferase 1A6 (UGT1A6) was sdected as it potentially 
metabolises tolcapone via glucuronidation. The alleles ThrlSlAla and Argl84Ser are 

15 described as showing reduced activity for levodopa and other substrates (Ciptti et al.. 
Pharmacogenetics (1997), 2> 485-495) .The known genetic polymorphisms in the UGTIA 
gene affect only single members of this gene duster of twelve genes (Figure 2). Therefore, it 
was reasoned that genetic variations in the potentially common regulatory region, that is 
the 3*-end of the gene, could have an effect on the expression of any of the twdve UGTIA 

20 genes. Moreover, it was suspected that UGT1A7 may be involved in the dimination of 
tolcapone. In order to identify new genetic polymorphisms common exons 2-5 and the 3* 
untranslated region of UGTIA and exon 1 of the UGT1A6, UGT1A7, UGT1A8, UGTIA^ 
and UGTIAIO genes were sequenced in 47 different DNA samples from ethnically diverse 
individuals. The 300 -700 bp fragments were column purified with the Qiaquick PCR 

25 purification kit on a Biorobot 9600 and both strands were sejquenced on an ABI3700 

capillary sequencer using dye-terminator chemistry and the PCR amplification primers as 
sequencing primers as described in detail above. A G/C variation was identified designated 
as UGT1A-3L908, which occurred in the followmg frequendes: CC: 0.63; GC:0.33; 
GG:0.04. The nmnber 908 refers to the position of the SNP relative to the DNA sequence 

30 with Genbank accession number M84124 from the public data bases. Moreover, the 

following polymorphisms have been identified in UGTl A6, UGT1A7, UGT1A8, UGT1A9 
and UGTIAIO genes: 

UGTlA6exonl_3 18 and UGTlA6exonl-528. The number refers to the position of 
the SNP relative to the DNA sequence with Genbank accession number M84130 from the 
35 public database. 
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UGT1A7 exonL197> UGTlA7exonl^824, UGTlA7exonl^920, and 
UGTlA7exonl_992. The numbers refer to the position of the SNP relative to the DNA 
sequence with Genbank accession number U39570 from the public database. 

UGTlA8promoter_245. The number refers to the position of the SNP relative to the 
5 DNA sequence with Genbank accessioi| number U42604 from the public database. 

UGTlA9exonl_214. The number refers to the position of the SNP relative to the 
DNA sequence with Genbank accession number AF056188 from the public database- 

UGTlA10exonl359. The number refers to the position of the SNP relative to tiie 
DNA sequence with Genbank accession number U39550 from the pubUc database. 

10 UGTl Aintron^l 17 and UGTlAintron_379. The numbers refer to the position of the 

SNPs relative to the DNA sequence with Genbank accession number M84122 from the 
pubUc database. 

UGTlAexon4_473. The number refers to the position of the SNP relative to the DNA 
sequence with Genbank accession number M84123 from the pubHc database. 

15 UGTlAexon5_423, UGTlAexon5_780, UGTlAexon5_908 (described above in 

detail) and UGTlAexon5_I012. The numbers refer to the position of the SNPs relative to 
the DNA sequence with Genbank accession number M84124 from the public domain. 

The patient samples were divided into two groups. Group 1 contained samples from 
case patients whose aspartate aminotransferase (AST:SGOT), alanine aminotransferse 
20 (ALT: SGPT), or bilirubin values were >1.5x ULN of the investigators range while taking 
tolcapone treatment Group 2 contained samples from control patients whose SGOT, 
SGPT, and bilirubin values were below Ix ULN vfhen measured while taking tolcapone 
treatment 

For each genotype, the following analyses were performed: 

25 a) Analysis of the entire genotype: patients were classified according to the foUowing three 
categories: homozygous 1/1 (two copies of allele 1 and no copies of allele 2), heterozj^ous 
1/2 (one copy of allele 1 and one copy of allele 2) or homozygous 2/2 (no copies of allde 1 
and two copies of allele 2). Analysis was conducted to Bssess whether there were differences 
in the proportion of case patients in each of these groups, compared with the proportion of 

30 control patients. The Cochran-Maentel-Hanszel (CMH) test was applied to the data 
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presented in a 2-by-3 table (the two columns indicating presence or absence of liver 
function abnormality and the three rows indicating the three categories of the genotype). 

b) Analysis of alleles: the presence of allele 1 or 2 in case patients was compared to the 
presence of the respective allele in the control patients. For each allele, the CMH test was 

5 applied using a 2-by-2 table (the two ccjlimms indicating presence or absence of liver 

function abnormality and the two rows indicating the presence or absence of the respective 
allele)> and the case-control odds ratio and 95% confidence inten^l were calculated. An 
odds-ratio of greater than 1.0, together with a confidence interval that does not indude 1.0, 
indicated a positive association between the presence of the allele and the occurrence of 

10 liver function abnormality. 

c) Analysis of allele counts: this analysis was conducted to compare the distribution of the 
alldes in patients with liver function abnormality to that of patients without abnormality. 
The total number of copies of allele 1 among patients with liver function abnormality was 
compared with the total number of copies of allele 2 among these case patients. The CMH 

15 test was applied to the data using a 2-by-2 table (the two colunms indicating presence or 
absence of liver function abnormality and the two rows indicating the two aUele counts). 
Again case-control odds ratio and 95% confidence intervals were obtained. 

Results 

A total of 409 patients treated with tolcapone, of which 135 had liver enzyme 
20 elevation of 1 .5 times or more above the upper limit and 274 were matched controls, were 
genotyped for different genetic markers from genes encoding enzymes involved in the 
metabolism of tolcapone, including the UGTl genes. The results from the analysis of the 
genetic markers that resulted in a significant association are presented in tables 4 to 13. All 
markers showing significant association to elevated liver transaminases corresponded to 
25 SNPs in the UGTl genes. 
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Table 4. UGT1A6_765 Argl84Ser 



Iiiver Function Abnormality 
Absent Present 



Genotype 



Allele Arg 



Allele Ser 



Count 



Homozygous Arg/Arg 128 ( 46.7%) 

Heterozygous' Arg/Ser 122 ( 44.5%) 

Homozygous Ser/Ser 24 ( 8.8%) 
P-value 



45 
64 
26 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 

Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 

Allele Arg 
Allele Ser 
Odds Ratio 
95% CI 
P-value 



0.0023 



■24 ( 8.8%) 26 
250 ( 91.2%) 109 
0.58 
0.40 
( 0.22, 0.72) 
0.0023 

128 { 46.7%) 45 
146 { 53.3%) 90 
1.47 
1-75 
( 1.14, 2.69) 
0.0101 

378 ( 69.0%) 154 
170 ( 31.0%) 116 
1.67 
( 1.24, 2.26) 
0.0008 



{ 33.3%) 
( 47.4%) 
( 19.3%) 



( 19.3%) 
( 80.7%) 



{ 33.3%) 
{ 66.7%) 



( 57.0%) 
( 43.0%) 
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Table 5. UGT1A6_754 ThrlSlAla 



Liver Function Abnormality 
Absent Present 



Genotype 



Allele Thr 



Allele Ala 



Count 



Homozygous Thr /Thr 144 ( 52.6%) 

Heterozygous ¥hr/Ala 108 ( 39.4%) 

Homozygous Ala/Ala 22 ( 8.0%) 
P~value 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 

Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 

Allele Thr 
Allele Ala 
Odds Ratio 
95% CI 
P-value 



22 
252 



144 
130 



( 8.0%) 
( 92.0%) 



( 0 



( 52.6%) 
( 47.4%) 



( 1 



0.0014 



51 { 37.8%) 
59 ( 43.7%) 
25 ( 18.5%) 



25 ( 18.5%) 
110 ( 81.5%) 



0.57 

0.38 
.21, 0.70> 
0.0018 

51 
84 

1.50 

1.82 
.20, 2.77) 
0.0050 



396 ( 72.3%) 161 
152 { 27.7%) 109 
1.76 
{ 1.30, 2.39) 
0.0003 



( 37.8%) 
( 62.2%) 



( 59.6%) 
( 40.4%) 
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Liver Function Abnormality 



Absent Present 



Genotype 


Homn y VCT oil q C / C! 


163 


( 


59 


.5%) 


101 


i 


74 


• 8%) 




Heterozygous oj^C 


97 


( 


35 


.4%) 


30 




22 


.2%) 




Homozygous 6/G 


14 


( 


5 


-1%) 


4 


( 


3 


.0%) 




*r V dJ. uc 










n 00Q7 








Allele C 




14 


( 


5 




4 


/ 
\ 


•a 


. 0%) 






260 


I 




• 7^ / 




1 


Q7 


























Odds Ratio 










1.76 










95% CI 








( 0. 


58, 5.4a) 










P-value 










0.3202 








Allele 6 


Absent 


163 


( 


59 


.5%) 


101 


( 


74 


.8%) 




Present 


111 


( 


40 


.5%) 


34 


( 


25 


.2%) 




Relative Risk 










0.61 










Odds Ratio 










0.49 










95% CI 








( 0. 


31, 0.78) 










P-value 










0.0023 








Count 


Allele C 


423 


( 


77 


.2%) 


232 


( 


85 


.9%) 




Allele G 


125 


( 


22 


.8%) 


38 


( 


14 


.1%) 




Odds Ratio 










0.55 










95% CI 








( 0. 


37, 0.82) 










P -value 










0.0033 
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Table 7. UGT1A6.232 Ser7AIa 



Liver Function Abnormality 



Absent 



Present 



Genotype 



Homozygous T/T 

Heterozygous tVg 
Homozygous G/G 
P-value 



112 ( 42,1%) 

123 ( 46.2%) 
31 { 11.7%) 



40 ( 30.5%) 

63 { 48.1%) 
28 ( 21.4%) 



0.0130 



Allele T 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 



31 ( 11.7%) 
235 ( 88.3%) 

0.64 
0.49 
( 0.28, 0.84) 
0.0106 



28 ( 21.4%) 
X03 ( 78.6%) 



Allele 6 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 



112 ( 42.1%) 
154 ( 57.9%) 



( 1 



40 
91 

1.41 

1.65 
06, 2.58) 
0.0259 



{ 30.5%) 
( 69.5%) 



Count 



Allele T 
Allele G 
Odds Ratio 

95% CI 
P-value 



347 ( 65.2%) 
185 ( 34.8%) 

1.56 
( 1.16, 2.11) 
0.0037 



143 ( 54.6%) 
119 { 45.4%) 
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Liver Function Abnormality 



Absent Present 



Genotype 



Homozygous A/A 129 ( 48,7%) 

Heterozygous A/G 114 ( 43.0%) 

Homozygous G/G 22 ( 8.3%) 
P-value 



44 
60 
26 



( 33.8%) 
( 46.2%) 
( 20.0%) 



0.0008 



Allele A 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 



22 ( 8.3%) 
243 ( 91.7%) 



26 
104 



0.55 
0.36 
0.20, 0.66} 
0.008 



( 20.0%) 
( 80.0%) 



Allele G 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P -value 



129 ( 48.7%) 44 
136 ( 51.3%) 86 
1.52 
1.85 
( 1.20, 2.68) 
0.0053 



( 33.8%) 
( 66.2%) 



Count 



Allele A 
Allele G 
Odds Ratio 
95% CI 
P-value 



372 ( 70.2%) 148 
158 { 29.8%) 112 
1.78 
( 0.31, 2.42) 
0.0002 



( 56.9%) 
( 43.1%) 
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Table 9: UGT1A7^197 OG 

Liver Function Abnormality 

Absent Present 



Genotype 



Homozygous A/A 32 ( 11.9%) 

Heterozygous a)c 112 { 41.6%) 

Homozygous C/C 125 ( 46.5%) 
P-value 



25 { 18.8%) 
62 ( 46.6%) 
46 ( 34.6%) 



0.0400 



Allele A 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 



125 
144 



( 46.5%) 
( 53.5%) 



( 1 



46 
87 

1.40 

1.64 
07, 2.52) 
0.0235 



( 34.6%) 
( 65.4%) 



Allele C 



Count 



Absent 
Present 
Relative Risk 
Odds Ratio 
95% CI 
P-value 

Allele A 
Allele G 
Odds Ratio 
95% CI 
P-value 



32 { 11.9%) 25 ( 18.8%) 

237 { 88.1%) 108 ( 81.2%) 

0.71 
0.58 
( 0.33, 1.03) 
0.0623 

176 ( 32.7%) 112 { 42.1%) 

362 { 67.3%) 154 ( 57.9) 

0.67 
( 0.49, 0.90) 
0.0090 
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Table 10: UGT1A7_551 Asnl29Lys 

Liver Pimction Abnormality 

Absent Present 



Genotype 


Homozygous G/G 


96 


( 


35 


.7%) 


59 


( 


44.7%) 




Heterozygous g)t 


124 


{ 


46 


.1%) 


59 


( 


44,7%) 




Homozygous T/T 


49 


( 


18 


.2%) 


14 


( 


10.6%) 




P-value 










0.0762 






Allele G 


Absent 


49 


( 


18 


.2%> 


14 


( 


10.6%) 




Present 


220 


{ 


81 


»8%) 


118 


{ 


89.4%) 




Relative Risk 










1.57 








Odds Ratio 










1.88 








95% CI 








( 1 


.00, 3.52) 








P-value 










0.0494 






Allele T 


Absent 


96 


( 


35 


.7%) 


59 


( 


44.7%) 




Present 


173 


( 


64 


.3%) 


7 


( 


55.3%) 




Relative Risk 










0.78 








Odds Ratio 










0.69 








95% CI 








( 0 


.45, 1.05) 








P-value 










0.0821 






Count 


Allele G 


116 


( 


58 


.7%) 


177 


( 


67.0%) 




Allele T 


222 


( 


41 


.3%) 


87 


{ 


33.0%) 




Odds Ratio 










0.70 








95% CI 








( 0 


.51, 0.95) 








P-value 










0.0232 
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Table ll:UGTlA7_555saentor Argl31Lys 

Liver Function Abnormality 

Absent Present 



Genotype 


Homozygous A/ A 


96 


( 


35 


.7%) 


58 


{ 


43.9%) 




Heterozygous A*/C 


124 


( 


46 


.1%) 


60 


( 


45.5%) 




Homozygous 6/6 


49 


( 


18 


.2%) 


14 


( 


10.6%) 




" P -value 










0.0894 






Allele A 


Absent 


49 


( 


18 


.2%) 


14 


( 


10 .6%) 




Present 


220 


( 


81 


.8%) 


118 


( 


89.4%) 




Relative Risk 










1.57 








Odds Ratio 










1.88 








95% CI 








( 1 


.00, 3.52) 








P-value 










0.0494 






Allele C 


Absent 


96 


( 


35 


.7%) 


58 


( 


43.9%) 




Present 


173 


{ 


64 


.3%) 


74 


{ 


56.1%) 




Relative Risk 










0.80 








Odds Ratio 










0.71 








95% CI 








( 0 


.46, 1.08) 








P-value 










0.1108 






Count 


Allele A 


316 


{ 


58 


.7%) 


176 


{ 


66.7%) 




Allele C 


222 


{ 


41 


.3%) 


88 


{ 


33.3%) 




Odds Ratio 










0.71 








95% CI 








( 0 


.52, 0.97) 








P-value 










0.0303 
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Table 12: UGT1A7_556 Argl31Lys or Gin 



Liver Fxanction Abnormality 
Absent Present 









/ 
1 


• Oi> f 


Ad 


/ 
\ 




• / 




Heterozygous A^G 


123 


( 


45.9%) 


59 


{ 


44 


.7%) 




Homozygous G/G 


49 


( 


18.3%) 


14 


{ 


10 


.6%) 


























/ 
I 


1 ft '^9t\ 


J.** 


I 


1 n 

-LU 


. O T> / 








/ 
I 




lift 

J.XD 


/ 
\ 








Relative Risk 








1.58 










Odds Ratio 








1.89 










95% CI 






( 1. 


01, 3.53) 










P -value 








0.0477 








Allele G 


Absent 


96 


( 


35.8%) 


59 


( 


44 


.7%) 




Present 


172 


( 


64.2%) 


73 


( 


55 


.3%) 




Relative Risk 








0.78 










Odds Ratio 








0.69 










95% CI 






( 0. 


45, 1.06) 










P-value 








0.0870 








Count 


Allele A 


315 


( 


58.8%) 


177 


( 


67 


.0%) 




Allele G 


221 


( 


41.2%) 


87 


{ 


33 


.0%) 




Odds Ratio 








0.70 










95% CI 






( 0. 


5i, 0.95) 










P-value 








0.0238 









5 
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Table 13: UGT1A7_786 Trp208Arg 



liiver Fiinction Abnormality 



Absent 



Present 



Genotype 



Allele T 



Allele G 



Count 



Homozygous T/T 


124 


( 


46 


.3%) 


46 


( 


34.3%) 


Heterozygous T/C 


114 


( 


42 


.5%) 


. 62 


( 


46.3%) 


Homozygous C/C 


30 


( 


11 


.2%) 


26 


( 


19.4%) 


P-value 










0.0224 






Absent 


30 


( 


11 


.2%) 


26 


( 


19.4%) 


Present 


238 


( 


88 


.8%) 


108 


( 


80.6%) 


Relative Risk 










0.67 






Odds Ratio 










0.52 






95% CI 








( 0 


.30, 0.92} 






P-value 










0.0252 






Absent 


124 


( 


46 


.3%) 


46 


( 


34.3%) 


Present 


144 


{ 


53 


.7%) 


86 


( 


65'. 7%) 


Relative Risk 










1.40 






Odds Ratio 










1.65 






95% CI 








( 1.07, 2.53) 






P-value 










0.0225 






Allele T 


362 


( 


67 


.5%) 


154 


( 


57.5%) 


Allele C . 


174 


( 


32 


,5%) 


114 


( 


42.5%) 


Odds Ratio 










1.54 






95% CI 








( 0 


.14, 2.08) 






P-value 










0,0050 
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The main metabolic pathway for tolcapone elimination is glucuronidation. The 
results from the current retrospective analysis have shown a significant association between 
three genetic polymorphisms in the UDP-glucuronosyltransferase gene and liver function 
abnormahty. These findings support the hypothesis that impaired elimination of tolcapone 

5. may be a cause for liver toxicity. In vitrq studies in rat hepatotocyte cultures have shown 
that inhibition of glucuronidation and oxidation increase cytotoxicity of tolcapone. 
Moreover, the UGT1A6 Alal81/Serl84 variant was shown to have reduced activity in vitro 
compared with the Thrl81/Argl84 variant (Qotti et aL, Pharmacogenetics 1997, 7, 485- 
495). This concurs with the findings firom the current analysis whereby the presence of 

10 AlalSl and the absence of Serl84 were associated with an incrementally higher risk of liver 
abnormality. The polymorphism located in the 3*UTR of the UGTIA gene (Figure 2) may 
affect the expression of all UGTlA genes involved in metaboUsm of Tolcapone. 
Alternatively, the polymorphism may be in linkage disequilibrium with another mutation 
that affects either the structure of the UGTIA proteins, or the expression of the gene. 

15 No significant association was found with the other markers tested. These results do 

not rule out the potential contribution of other polymorphisms within the gene tested. 

The relatively low odd ratios that result from these associations, are due to the 
multifactorial nature of drug induced Uver toxicity. It was clear throughout this study that 
the occurrence of liver enzyme elevation upon treatment vnth tolcapone was the result of 
20 multiple &ctors including external influences such as co-medication, and or the 
combination of several genetic fectors in diflferent individuals. 
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SEQUENCE LISTING 

<110> F- Hoffmann- La Roche AG 

5 <120> Method for detecting a pre-disposition to drug induced 
hepatotoxicity 

<130> 20707 ELMS 

I 

10 <140> 
<141> 

<150> EP00115353.5 
<151> 2000-07-14 

15 

<160> 56 

<170> Patent In Ver. 2.1 

20 <210> 1 
<211> 1561 
<212> DNA 
<213> Homo sapiens 

25 <300> 

<308> GENBANK M84124 

<400> 1 

taattccagc tactctggag gctgaggcag gaggatggct tgagcccagg agttggaggc 60 

30 tgcagttagc catgcttgtg ccactacact ccagcccggg caacagggca agactctgta 120 
tctaaaaaca acaacaacaa caataataga aacaggtttc ctttcccaag tttggaaaat 180 
ctggtagtct tcttaagcag ccatgagcat aaagagagga ttgttcatac cacaggtgtt 240 
ccaggcataa cgaaactgtc tttgtgttta gttacaagga gaacatcatg cgcctctcca 300 
gccttcacaa ggaccgcccg gtggagccgc tggacctggc cgtgttctgg gtggagtttg 360 

35 tgatgaggca caagggcgcg ccacacctgc gccccgcagc ccacgacctc acctggtacc 420 
agtaccattc cttggacgtg attggtttcc tcttggccgt cgtgctgaca gtggccttca 480 
tcacctttaa atgttgtgct tatggctacc ggaaatgctt ggggaaaaaa gggcgagtta 540 
agaaagccca caaatccaag acccattgag aagtgggtgg gaaataaggt aaaattttga 600 
accattccct agtcatttcc aaacttgaaa acagaatcag tgttaaattc attttattct 660 

40 tattaaggaa atactttgca taaattaatc agccccagag tgctttaaaa aattctctta 720 
aataaaaata atagactcgc tagtcagtaa agatatttga atatgtatcg tgccccctct 780 
ggtgtctttg atcaggatga catgtgccat ttttcagagg acgtgcagac aggctggcat 840 
tctagattac ttttcttact ctgaaacatg gcctgtttgg gagtgcggga ttcaaaggtg 900 
gtcccacggc tgcccctact gcaaatggca gttttaatct tatcttttgg cttctgcaga 960 

45 tggttgcaat tgatccttaa ccaataatgg tcagtcctca tctctgtcgt gcttcatagg 1020 
tgccaccttg tgtgtttaaa gaagggaagc tttgtacctt tagagtgtag gtgaaatgaa 1080 
tgaatggctt ggagtgcact gagaacagca tatgatttct tgctttgggg aaaaagaatg 1140 
atgctatgaa attggtgggt ggtgtatttg agaagataat cattgcttat gtcaaatgga 1200 
gctgaatttg ataaaaaccc aaaatacagc tatgaagtgc tgggcaagtt tacttttttt 1260 

50 ctgatgtttc ctacaactaa aaataaatta ataaatttat ataaattcta tttaagtgtt 1320 
ttcactggtg tcgcatttat ttcttgttaa gttgcatttt ctaattacaa aagtaatgca 1380 
tgattatgac agaaagtttg gaaaatatag aggttcacac acacacgcct tcattgcgtg 1440 
tgcatgcata aatgcatgag aaaagaaaaa taaccagtaa tcacatcgcc cagaaataac 1500 
cccagttaca attgtggcaa atacacatac ttataaatat tgcagatata ttaagtatac 1560^ 

55 c 1561 



<210> 2 
<211> 1220 
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<212> DNA 

<213> Homo sapiens 
<300> 

5 <308> GENBANK M84130 
<400> 2 

tgacacggcc atagttggtt catattaacc atgtgattaa aatggttaaa tattaatttg 60 
ggttcttaca tatcaaaggg taaaattcafer agcaagggag aggtagacag gacctgtgaa 120 

10 aagcagtggt tagtttaggg aaaataccta ggagccctgt gatttggaga gtgaaaactc 180 
tttattaccg ttgttacttt aactctttcc aggatggcct gcctccttcg ctcatttcag 240 
agaatttctg caggggtttt cttcttagca ctttggggca tggttgtagg tgacaagctg 300 
ctggtggtcc ctcaggacgg aagccactgg cttagtatga aggatatagt tgaggttctc 360 
agtgaccggg gtcatgagat tgtagtggtg gtgcctgaag ttaatttgct tttgaaagaa 420 

15 tccaaatact acacaagaaa aatctatcca gtgccgtatg accaagaaga gctgaagaac 480 
cgttaccaat catttggaaa caatcacttt gctgagcgat cattcctaac tgctcctcag 540 
acagagtaca ggaataacat gattgttatt ggcctgtact tcatcaactg ccagagcctc 600 
ctgcaggaca gggacaccct gaacttcttt aaggagagca agtttgatgc tcttttcaca 660 
gacccagcct taccctgtgg ggtgatcctg gctgagtatt tgggcctacc atctgtgtac 720 

20 ctcttcaggg gttttccgtg ttccctggag catacattca gcagaagccc agaccctgtg 780 
tcctacattc ccaggtgcta cacaaagttt tcagaccaca tgactttttc ccaacgagtg 840 
gccaacttcc ttgttaattt gfctggagccc tatctatttt attgtctgtt ttcaaagtat 900 
gaagaactcg catcagctgt cctcaagaga gatgtggata taatcacctt atatcagaag 960 
gtctctgttt ggctgttaag atatgacttt gtgcttgaat atcctaggcc ggtcatgccc 1020 

25 aacatggtct tcattggagg tatcaactgt aagaagagga aagacttgtc tcaggttggt 1080 
gggtttattt cttttggact gccttgtttc ttccaggctc tgtcctccct cactcatttg 1140 
gctccttgag ccgactgtcc cttggaggat ttcctggaga acggtggggg gaagtgatac 1200 
ccggctcgga gcagcgggaa 1220 

30 

<210> 3 

<211> 1279 

<212> DNA 

<213> Homo sapiens 

35 

<300> 

<308> GENBANK U39570 
<400> 3 

40 tgtattatta tgagtaaatc attggcagtg aatgtgaatt tttttttaaa tgaatgaata 60 
agtacacgcc ttcttttgag ggcaggttct atctgtactt cttccactta ctatattata 120 
ggagcttaga atcccagctg ctggctctgg gctgaagttc tctgatggct cgtgcagggt 180 
ggactggcct ccttccccta tatgtgtgtc tactgctgac ctgtgctttg ccaaggtcag 240 
ggaagctgct ggtagtgccc atggatggga gccactggtt caccatgcag tcggtggtgg 300 

45 agaaactcat cctcaggggg catgaggtgg tcgtagtcat gccagaggtg agttggcaac 360 
tgggaagatc actgaattgc acagtgaaga cttactcaac ctcatacact ctggaggatc 420 
aggaccggga gttcatggtt tttgccgatg ctcgctggac ggcaccattg cgaagtgcat 480 
tttctctatt aacaagttca tccaatggta tttttgactt atttttttca aattgcagga 540 
gtttgtttaa tgaccgaaaa ttagtagaat acttaaagga gagttgtttt gatgcagtgt 600 

50 ttctcgatcc ttttgatcgc tgtggcttaa ttgttgccaa atatttctcc ctcccctctg 660 
tggtcttcgc caggggaata ttttgccact atcttgaaga aggtgcacag tgccctgctc 720 
ctctttccta tgtccccaga cttctcttag ggttctcaga cgccatgact ttcaaggaga 780 
gagtatggaa ccacatcatg cacttggagg aacatttatt ttgcccctat tttttcaaaa 840 
atgtcttaga aatagcctct gaaattctcc aaacccctgt cacggcatat gatctctaca 900 

55 gccacacatc aatttggttg ttgcgaactg actttgtttt ggagtatccc aaacccgtga 960 
tgcccaatat gatcttcatt ggtggtatca actgtcatca gggaaagcca gtgcctatgg 1020 
taagttatct cccctttagc acattaagaa taatctggct ttggaaatta aaagatttct 1080 
tacagaatca taatttatca tttacatttg tcccatttgg aatttctttc tggtttaagg 1140 
aattcttttg taccaattca cttaattgtt gggtagcaaa ttgtataaag cagctcttgt 1200 
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tgatatgtaa gtgtatacaa ttgatataat tgtagatcat atctaggctg caatctaaat 1260 
gctatttttg gaaaaatac 1279 



5 <210> 4 
<211> 1452 
<212> DNA 
<213> Homo sapiens 

I 

10 <300> 

<308> 6ENBANK U39550 

<400> 4 

ctctccctcc aaggcgaaga ccataatcta ctcttgtctg aaatcataca agtaggtatc 60 

15 tcagcaaatg atactcgtgt gttatcgttc ttatgagtaa atcattggca gtgagtgtga 120 
tttttttttt ttttatgaaa ggataaatac acgccctcta ttggggtcag gttttgtgcc 180 
tgtacttctt ccgcctactg tatcatagca gcttagaatc ccagctgctg gctcgggctg 240 
cagttctctc atcgtcgcgc agggtgatgg ctcgcgcagg gtggaccagc cccgttcctt 300 
tatgtgtgtg tctactgctg acctgtggct ttgccgaggc agggaagctg ctggtagtgc 360 

20 ccatggatgg gagtcactgg ttcaccatgc agtcggtggt ggagaaactt atcctcaggg 420 
ggcatgaggt ggttgtagtc atgccagagg tgagttggca actggaaaga tcactgaatt 480 
gcacagtgaa gacttactca acctcgtaca ctctggaaga tcagaaccgg gaattcatgg 540 
ttttcgccca tgctcaatgg aaagcacagg cacaaagtat attttctcta ttaatgagtt 600 
catccagtgg ttttcttgac ttattttttt cgcattgcag gagtttgttt aatgaccgaa 660 

25 aattagtaga atacttaaag gagagttctt ttgatgcagt gtttctggat ccttttgata 720 
cctgtggctt aattgttgct aaatatttct ccctcccctc tgtggtcttc accaggggaa 780 
tattttgcca ccatcttgaa gaaggtgcac agtgccctgc tcctctttcc tatgtcccca 840 
atgatctctt agggttctca gatgccatga ctttcaagga gagagtatgg aaccacatcg 900 
tgcacttgga ggaccattta ttttgccagt atctttttag aaatgcccta gaaatagcct 960 

30 ctgaaattct ccaaacccct gtcacggcat atgatctcta cagtcacaca tcaatttggt 1020 
tgttgcgaac ggactttgtt ttggactatc ccaaacccgt gatgcccaac atgatcttca 1080 
ttggtggtat caactgtcat cagggaaagc cattgcctat ggtaagtcac ctctccttta 1140 
gcacattaag aataatctgg ctttggaatt aaaaaaggat tccttactga actgtgattt 1200 
gacatttcgt tgtggcattc aatttctttc cagtttaaca aattattttg tgcgaattca 1260 

35 tgtactcatc aattatcaaa ttttataaaa ctgcccttct tgaaagtata tgtaataatt 1320 
taaaaattat agatcatatt caggctacat tttaaaatac gatgtttaga aaagtaccaa 1380 
aaaaccacag caagaaatga ad:cttccgtt tttt.tgttat tctatgtgac cccgtagttg 1440 
aaaatgctct ta 1452 

40 

<210> 5 

<211> 1244 

<212> DNA 

<213> Homo sapiens 

45 

<300> 

<308> GENBAIJK U42604 
<400> 5 

50 gggcatgatc tgtccaaggc agagactata agctactctt atagtactct tatgagatac 60 
atacaagtag gtatctcaaa aaatgatact catgtattcc tgttcttatg agtaaatcat 120 
tggcagtgag tgtgattttt ttttttttta tgacaggatc cctacacgcc ctctattggg 180 
♦ gtcaggtttt gtgcctgtag ttcttccgcc tacgtatcat agcagttaga atcccagctg 240 
ctggctcggg ctgcagttct ctcatggctc gcacagggtg gaccagcccc attcccctat 300 

55 gtgtttctct gctgctgacc tgtggctttg ctgaggcagg gaagctgctg gtagtgccca 360 
tggatgggag tcactggttc accatgcagt cggtggtgga gaaacttatc ctcagggggc 420 
atgaggtggt tgtagtcatg ccagaggtga gttggcaact gggaaaatca ctgaattgca 480 
cagtgaagac ttactcaacc tcatacactc tggaggatct ggaccgggaa ttcatggatt 540 
tcgccgatgc tcaatggaaa gcacaagtac gaagtttgtt ttctctattt ctgagttcat 600 
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ccaatggttt ttttaactta tttttttcgc 
tagtagaata cttaaaggag agttcttttg 
gtgcgttaat tgttgccaaa tatttctccc 
gttgccacta tcttgaagaa ggtgcacagt 

5 ttctcttagg gttctcagat gccatgactt 
acttggagga acatttattt tgccagtatt 
aaattctcca aacacctgtc acagcatatg 
tgcgaacaga ctttgttttg gactatccca 
gtggtatcaa ctgccatcag ggaaagccab 

10 cattaggaat aatcttggct ttggaaatta 
acattttcat ttgttgcatt tcaaatttct 



-44- 

attgcaggag tttgtttaat gaccgaaaat 660 
atgcggtgtt tcttgatcct tttgatgcct 720 
tcccctctgt ggtcttcgcc aggggaatag 780 
gccctgctcc tctttcctat gtccccagaa 840 
tcaaggagag agtacggaac cacatcatgc 900 
tttccaaaaa tgccctagaa atagcctctg 960 
atctctacag ccacacatca atttggttgt 1020 
aacccgtgat gcccaatatg atcttcattg 1080 
tgcctatggt aagtcacctc tcctttagca 1140 
aaaaaagatt ccttactgaa ttgtgatttg 1200 
ttccagttta caga 1244 



<210> 6 
15 <211> 2320 
<212>.DNA 
<213> Homo sapiens 

<300> 

20 <308> 6ENBANK AF056188 
<400> 6 

ctcagctgca gttctctgat ggcttgcaca gggtggacca gcccccttcc tctatgtgtg 60 
tgtctgctgc tgacctgtgg ctttgccgag gcagggaagc tactggtagt gcccatggat 120 

25 gggagccact ggttcaccat gaggtcggtg gtggagaaac tcattctcag ggggcatgag 180 
gtggttgtag tcatgccaga ggtgagttgg caactgggaa gatcactgaa ttgcacagtg 240 
aagacttatt caacttcata taccctggag gatctggacc gggagttcaa ggcttttgcc 300 
catgctcaat ggaaagcaca agtacgaagt atatattctc tattaatggg ttcatacaat 360 
gacatttttg.acttattttt ttcaaattgc aggagtttgt ttaaagacaa aaaattagta 420 

30 gaatacttaa aggagagttc ttttgatgca gtgtttctcg atccttttga taactgtggc 480 
ttaattgttg ccaaatattt ctccctcccc tccgtggtct tcgccagggg aatactttgc 540 
cactatcttg aagaaggtgc acagtgccct gctcctcttt cctatgtccc cagaattctc 600 
ttagggttct cagatgccat gactttcaag gagagagtac ggaaccacat catgcacttg 660 
gaggaacatt tattatgcca ccgttttttc aaaaatgccc tagaaatagc ctctgaaatt 720 

35 ctccaaacac ctgttacgga gtatgatctc tacagccaca catcaatttg gttgttgcga 780 
acggactttg ttttggacta tcccaaaccc gtgatgccca acatgatctt cattggtggt 840 
atcaactgcc atcagggaaa gccgttgcct atggaatttg aagcctacat taatgcttct 900 
ggagaacatg gaattgtggt tttctctttg ggatcaatgg tctcagaaat tccagagaag 960 
aaagctatgg caattgctga tgctttgggc aaaatccctc agacagtcct gtggcggtac 1020 

40 actggaaccc gaccatcgaa tcttgcgaac aacacgatac ttgttaagtg gctaccccaa 1080. 
aacgatctgc ttggtcaccc gatgacccgt gcctttatca cccatgctgg ttcccatggt 1140 
gtttatgaaa gcatatgcaa tggcgttccc atggtgatga tgcccttgtt tggtgatcag 1200 
atggacaatg caaagcgcat ggagactaag ggagctggag tgaccctgaa tgttctggaa 1260 
atgacttctg aagatttaga aaatgctcta aaagcagtca tcaatgacaa aagttacaag 1320 

45 gagaacatca tgcgcctctc cagccttcac aaggaccgcc cggtggagcc gctggacctg 1380 
gccgtgttct gggtggagtt tgtgatgagg cacaagggcg cgccacacct gcgccccgca 1440 
gcccacgacc tcacctggta ccagtaccat tccttggacg tgattggttt cctcttggcc 1500 
gtcgtgctga cagtggcctt catcaccttt aaatgttgtg cttatggcta ccggaaatgc 1560 
ttggggaaaa aagggcgagt taagaaagcc cacaaatcca agacccattg agaagtgggt 1620 

50 gggaaataag gtaaaatttt gaaccattcc ctagtcattt ccaaacttga aaacagaatc 1680 
agtgttaaat tcattttatt cttattaagg aaatactttg cataaattaa tcagccccag 1740 
agtgctttaa aaaattctct taaataaaaa taatagactc gctagtcagt aaagatattt 1800 
gaatatgtat cgtgccccct ccggtgtctt tgatcaggat gacatgtgcc atttttcaga 1860 
ggacgtgcag acaggctggc attctagatt acttttctta ctctgaaaca tggcctgttt 1920 

55 gggagtgcgg gattcaaagg tggtcccacc gctgccccta ctgcaaatgg cagttttaat 1980 
cttatctttt ggcttctgca gatggttgca attgatcctt aaccaataat ggtcagtcct 2040 
catctctgtc ctgcttcata ggtgccacct tgtgtgttta aagaagggaa gctttgtacc 2100 
tttagagtgt aggtgaaatg aatgaatggc ttggagtgca ctgagaacag catatgattt 2160 
cttgctttgg ggaaaaagaa tgatgctatg aaattggtgg gtggtgtatt tgagaagata 2220 
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atcattgctt atgtcaaatg gagctgaatt 
gctgggcaag tttacttttt ttctgatgtt 



-45- 

tgataaaaac ccaaaataca gctatgaagt 2280 
tcctacaact 2320 



5 <210> 7 
<211> 512 
<212> DNA 
<213> Homo sapiens 

I 

10 <300> 

<308> 6ENBANK M84122 

<400> 7 

tttgcatctc aaggataatt ctgtaagcag 
15 gagaggaaaa tgctgtaaaa cttacatatt 
tgcctttaat catagtctta agaggaagat 
tcaaagaata tgagaaaaaa ttaactgaaa 
ctacattaat gcttctggag aacatggaat 
agaaattcca gagaagaaag ctatggcaat 
20 agtaagaaga ttctatacca tggcctcata 
ttccagcttc cagattaatt ctcttaattg 
tcccaactat taatccaaag gttttttttg 



gaacccttcc tcctttagaa ggaagtaaag 60 
aataattttt tactctatct caaacacgca 120 
atctaattca taacttactg tatgtagtca 180 
atttttcttc tggctctagg aatttgaagc 240 
tgtggttttc tctttgggat caatggtctc 300 
tgctgatgct ttgggcaaaa tccctcagac 360 
tctattttca caggagcgct aatcccagac 420 
gaaccttaga tttggctttt ccctgccact 480 
tt 512 



25 <210> 8 
<211> 784 
<212> DNA 
<213> Homo sapiens 

30 <300> 

<308> GENBANK M84123 

<400> 8 

aaagatgtcc tcaagggacc ctgttttcta 
35 attatgtctt tctttacgtt ctgctctttt 
gaacccgacc atcgaatctt gcgaacaaca 
atctgcttgg tatgttgggc ggattggatg 
atggcttaag cacagctatt ctaaaggatt 
tcctacattg ctttttatct agtggggtat 
40 gcaagggcat gtgagtaaca ctgagtcttt 
ctgtgaaact cagagatgta actgctgaca 
atgacccgtg cctttatcac ccatgctggt 
ggcgttccca tggtgatgat gcccttgttt 
gagactaagg gagctggagt gaccctgaat 
45 aatgctctaa aagcagtcat caatgacaaa 
tggtcatggc attcatgata aaattgtttc 
acgt 



gttagtatag cagatttgtt ttctaatcat 60 
gcccctccca ggtcctgtgg cggtacactg 120 
cgatacttgt taagtggcta ccccaaaacg 180 
tataggtcaa accagggtca aattaagaaa 240 
gttgagcttg aaaatattat ggccaacata 300 
ctcaacccac attttcttct gcaaatttct 360 
ggagtgtttt cagaacctag atgtgtccag 420 
tcctccctat tttgcatctc aggtcacccg 480 
tcccatggtg tttatgaaag catatgcaat 540 
ggtgatcaga tggacaatgc aaagcgcatg 600 
gttctggaaa tgacttctga agatttagaa 660 
aggtaagaaa gaagatacag aagaatactt 720 
aaatatgaaa acatttacgt agcatttaat 780 

784 



50 <210> 9 

<211> 18 

<212> mA 

<213> Homo sapiens 

55 <400> 9 

gcacaccttg tccttcat 18 



<210> 10 
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<211> 18 

<212> DNA 

<213> Homo sapiens 

5 <400> 10 

gcacaccttg tccttcac 18 



<210> 11 « 
10 <211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 11 

15 catcaccatc gagatcaac 19 



<210> 12 
<211> 20 
20 <212> DNA 

<213> Homo sapiens 

<400> 12 

ctattaaatc gcctctctct 20 

25 

<210> 13 
<211> 20 
<212> DNA 
30 <213> Homo sapiens 

<400>- 13 

ctattaaatc gcctctctcc 20 



35 



40 



<210> 14 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 14 

ggatgaattt caagtattt 19 



45 <210> 15 
<211> 18 
<212> DNA 

<213> Homo sapiens 
50 <400> 15 

agcccagata ccccaaag 18 



<210> 16 
55 <211> 18 
<212> DNA 

<213> Homo sapiens 



<400> 16 
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agcccagata ccccaaaa 



18 



<210> 17 
5 <211> 18 
<212> DNA 
<213> Homo sapiens 

<400> 17 * 
10 tgtgctttct cgtcttca 18 



<210> 18 
<211> 18 
15 <212> DNA 

<213> Homo sapiens- 

<400> 18 

tgtaattcct gccgtcag 18 



<210> 19 
<211> 18 
<212> DNA 
25 <213> Homo sapiens 

<400> 19 

tgtaattcct gccgtcaa 18 



<210> 20 

<211> 17 

<212> DNA 

<213> Homo sapiens 

35 

<400> 20 

atacagcact ggcatgg 17 



40 <210> 21 
<211> 18 
<212> DNA 
<213> Homo sapiens 

45 <400> 21 

cctggagttt gtggggcg 



<210> 22 
50 <211> 18 
<212> DNA 
<213> Homo sapiens 



20 



30 



55 



<400> 22 

cctggagttt gtggggca 



18 



<210> 23 
<211> 18 



wo 02/06523 



PCT/EPOl/07524 



-48- 



<212> DNA 

<213> Homo sapiens 



<400> 23 
5 tgaaccatga agtccacg 



18 



<210> 24 
<211> 18 



10 <212> DNA 

<213> Homo sapiens 



<400> 24 

cgtgttccct ggagcata 



18 



15 



<210> 25 
<211> 18 
<212> DNA 
20 <213> Homo sapiens 

<400> 25 

cgtgttccct ggagcatg 18 



<210> 26 

<211> 20 

<212> DNA 

<213> Homo sapiens 

30 

<400> 26 

gaatgtagga cacagggtct 20 



35 <210> 27 
<211> 18 
<212> DNA 
<213> Homo sapiens 

40 <400> 27 

gacacagggt ctgggctt 18 



<210> 28 
45 <211> 18 
<212> DNA 
<213> Homo sapiens 



25 



<400> 28 
50 gacacagggt ctgggctg 



18 



<210> 29 
<211> 19 



55 <212> DNA 

<213> Homo sapiens 



<400> 29 

tacctcttca ggggttttc 



19 
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<210> 30 
<211> 16 
5 <2X2> DNA 

<213> Homo sapiens 



10 



<400> 30 

tgcagtaggg gcagcg ^ 16 



<210> 31 
<211> 16 
<212> DNA 
15 <213> Homo sapiens 

<400> 31 

tgcagtaggg gcagcc 16 

20 

<210> 32 

<211> 16 

<212> DNA 

<213> Homo sapiens 

25 

<400> 32 

ggagtgcggg attcaa 16 



30 <210> 33 
<211> 20 
<212> DNA 
<213> Hcxao sapiens 

35 <400> 33 

acacggccat agttggttca 20 



<210> 34 
40 <211> 20 
<212> DNA 
<213> Homo sapiens 

<400> 34 

45 cagttgatga agtacaggcc 20 



<210> 35 
<211> 20 
50 <212> DNA 

<213> Homo sapiens 



55 



<400> 35 

tgtagtggtg gtgcctgaag 20 



<210> 36 
<211> 21 
<212> DNA 
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<213> Homo sapiens 
<400> 36 

gacagctgat gcgagttctt c 

5 

<210> 37 
<211> 20 

<212> DNA I 
10 <213> Homo sapiens 

<400> 37 

gagggcaggt tctatcgtac 

15 

<210> 38 

<211> 20 

<212> DNA 

<213> Homo sapiens 

20 

<400> 38 

gggcactgtg caccttcttc 



-50- 



21 



20 



25 <210> 39 
<211> 20 
<212> DNA 
<213> Homo sapiens 

30 <400> 39 

acggcaccat tgcgaagtgc 20 



<210> 40 
35 <211> 23 
<212> DNA 
<213> Homo sapiens 

<400> 40 

40 acttacatat caacaagtgc tgc 23 



<210> 41 
<211> 20 
45 <212> DNA 

<213> Homo sapiens 

<400> 41 

gggcatgatc tgtccaaggc 20 

50 

<210> 42 
<211> 22 
<212> DNA 
55 <213> Homo sapiens 



<400> 42 

ggttgagtaa gtcttcactg tg 



22 
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10 



15 



-51 



<210> 43 
<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 43 

ctcagctgca gttctctg 18 



<210> 44 

<211> 21 

<212> mA 

<213> Homo sapiens 

<400> 44 

ccagatcctc cagggtatat g 21 



20 <210> 45 
<211> 20 
<212> DNA 
<213> Homo sapiens 

25 <400> 45 

gagttcatcc agtggttttc 20 



<210> 46 
30 <211> 18 
<212> DNA 
<213> Homo sapiens 

<400> 46 

35 cagttcagta aggaatcc 18 



<210> 47 
<211> 22 
40 <212> DNA 

<213> Homo sapiens 



45 



55 



<400> 47 

caaggataat tctgtaagca gg 22 



<210> 48 
<211> 22 
<212> DNA 
50 <213> Homo sapiens 

<4a0> 48 

ggattaatag ttgggaagtg gc 22 



<210> 49 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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10 
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<400> 49 

ggccaacata tcctacattg 20 



<210> 50 
<211> 23 
<212> DNA 

<213> Homo sapiens * 
<400> 50 

cgtattaaat gctacgtaaa tgt 23 



15 <210> 51 
<211> 20 
<212> DNA 
<213> Homo sapiens 

20 <400> 51 

cagttagcca tgcttgtgcc 20 



<210> 52 
25 <211> 20 
<212> DNA 
<213> Homo sapiens 

<400> 52 

30 gcactctggg gctgattaat 20 



<210> 53 
<211> 19 
35 <212> DNA 

<213> Homo sapiens 

<400> 53 

cgtgctgaca gtggccttc 19 

40 

<210> 54 
<211> 20 
<212> DNA 
45 <213> Homo sapiens 

<400> 54 

cagtgcactc caagccattc 20 

50 

<210> 55 

<211> 18 

<212> DNA 

<213> Homo sapiens 

55 

<400> 55 

gatggttgca attgatcc 18 
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<210> 56 

<211> 20 

<212> DNA 

<213> Homo sapiens 

5 

<400> 56 

ttagttgtag gaaacatcag 20 
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Claims 

1. A method for detecting a predisposition to a hepatototoxic reaction of a human 
being caused by the administration of a pharmaceutically active compound based on the 
determination of at least one single nucleotide polymorphism in the UDP- 

5 glucuronosyltransferase (UGTl) gene ifi a sample of said human being, which method 
comprises 

determining the nucleotide at position 908 in exon 5 of the UGTl gene as defined by the 
position in SEQ ID NO:l, and/or 

determining the nucleotide at position 528 in exon 1 of the UGT1A6 gene as defined by the 
10 position in SEQ ID NO:2, and/or 

determining the nucleotide at position 197 in exon 1 of the UGT1A7 gene as defined by the 
position in SEQ ID NO:3, and 
determining the status of the human being, 

2. A method for detecting a predisposition to a hepatototoxic reaction of a human 
15 being caused by the adnndnistration of a pharmaceutically active compound based on the 

determination of at least one single nucleotide polymorphism in the UDP- 
glucuronosyltransferase (UGTl) gene in a sample of said human being, which method 
comprises determining the nucleotide at position 908 in exon 5 of the UGTl gene as 
defined by the position in SEQ ID N0:1 and determining the status of the human being. 

20 3. The method according to any one of daims 1 or 2, wherein additionally the 

polymorphism at one or more of the following positions is determined: 
position 232 in exon 1 of UGT1A6 as defined by the position in SEQ ID NO:2, or 
position 754 in exon 1 of UGT1A6 as defined by the position in SEQ ID NO:2, or 
position 765 in exon 1 of UGT1A6 as defined by the position in SEQ ID NO:2, or 

25 position 551 in exon 1 of UGT1A7 as defined by the position in SEQ ID NO:3> or 
position 555 in exon 1 of UGTl A7 as defined by the position in SEQ ID NO:3, or 
position 556 in exon 1 of UGT1A7 as defined by the position in SEQ ID NO:3, or 
position 786 in exon 1 of UGT1A7 as defined by the position in SEQ ID NO:3. 

4. A method according to any one of claims 1 to 3, in which the single nucleotide 
30 polymorphism at position 908 in exon 5 of the UGTl gene locus consists of the presence of 
a C or a G, the single nucleotide polymorphism at position 528 in exon 1 of UGT1A6 
consists of the presence of a G or an A, the isingle nucleotide polymorphism at position 197 
in exon 1 of UGT1A7 consists of the presence of a G or a C, the single nucleotide 
polymorphism at position 232 in exon 1 of UGT1A6 consists of the presence of a G or a T> 
35 the single nucleotide polymorphism at position 754 in exon 1 of UGT1A6 consists of the - 
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presence of an A or a the single nucleotide polymorphism at position 765 in exon 1 of 
UGT1A6 consists of the presence of an A or a C, the single nucleotide polymorphism at 
position 551 in exon 1 of UGT1A7 consists of the presence of an G or a T, the single 
nucleotide polymorphism at position 555 in exon 1 of UGT1A7 consists of the presence of 
5 an A or a Q the single nucleotide polymorphism at position 556 in exon 1 of UGT1A7 
consists of the presence of an A or a C,*and the single nucleotide polymorphism at position 
786 in exon 1 of UGT1A7 consists of the presence of a C or a T. 

5. A method as daimed in claims 1 to 4> wherein the region containing the potential 
polymorphism is amplified, preferably by polymerase chain reaction, prior to determining 

10 the sequence. 

6. A diagnostic nucleic add comprising the folloMong polymorphism containing 
sequences: 

the nudeic acid sequence of SEQ ID NO:l with C at position 908 in exon 5 of UGTl as 
defined by the position in SEQ ID NO:l; . 
15 the nudeic add sequence of SEQ ID NO:l with G at position 908 in exon 5 of UGTl as 
defined by the position in SEQ ID NO:l; 

the nudeic add sequence of SEQ ID NO:2 with G at position 232 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID N0:2; 

the nucldc add sequence of SEQ ID NO:2 with T at position 232 in exon 1 of UGT1A6 as 
20 defined by the position in SEQ ID N0:2; 

the nucldc add sequence of SEQ ID N0:2 with G at position 528 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID N0:2; 

the nucleic acid sequence of SEQ ID NO:2 with A at position 528 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID NO:2; 
25 the nucleic add sequence of SEQ ID NO:2 with G at position 754 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID N0:2; 

the nudeic add sequence of SEQ ID N0:2 with A at position 754 in exon 1 of UGT1A6 as 
defined by tibe position in SEQ ID NO:2; 

the nudeic add sequence of SEQ ID NO:2 with C at position 765 in exon 1 of UGTl A6 as 
30 defined by the position in SEQ ID N0:2; 

the nucleic add sequence of SEQ IDNO:2 with A at position 765 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID N0:2; 

the nudeic add sequence of SEQ ID NO:3 with G at position 197 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID N0:3; 
35 the nudeic add sequence of SEQ ID NO:3 with C at position 197 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID N0:3; 
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the nucleic acid sequence of SEQ ID Nb:3 with G at position 551 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID N0:3; 

the nudeic add sequence of SEQ ID NO:3 with T at position 551 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 
5 the nucleic add sequence of SEQ ID NO:3 with A at position 555 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:5; 

the nucleic add sequence of SEQ ID NO:3 with C at position 555 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 

the nucleic acid sequence of SEQ ID NO:3 with A at position 556 in exon 1 of UGTl A7 as 
10 defined by the position in SEQ ID NO:3; 

the nucleic add sequence of SEQ ID NO:3 with G at position 556 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 

the nudeic add sequence of SEQ ID NO:3 with C at position 786 in exon 1 of UGTl A7 as 
defined by the position in SEQ ID N0:3; 
15 the nuddc add sequence of SEQ ID NO:3 with T at position 786 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID N0:3; or 

a complementary strand thereof or a fi:agment thereof of at least 20 bases comprising at 
least one of the polymorphisms. 

7. A diagnostic nudeic add sdected from the group consisting of 

20 the nudeic add sequence of SEQ ID NO:l with C at position 908 in exon 5 as defined by 
the position in SEQ ID NO:l; 

the nudeic add sequence of SEQ ID NO:2 with G at position 528 in exon 1 of UGTl A6 as 
defined by the position in SEQ ID NO:2; or 

the nudeic add sequence of SEQ ID NO:3 with G at position 197 in exon 1 of UGT1A7 as 
25 defined by the position in SEQ ID N0:3. 

8. A diagnostic nudeic add selected from the group consisting of 

the nucleic add sequence of SEQ ID NO:l witii G at position 908 in exon 5 of UGTl as 
defined by the position in SEQ ID NO:l; 

the nudeic add sequence of SEQ ID NO:2 with G at position 232 in exon 1 of UGT1A6 as 
30 defined by the position in SEQ ID NO:2; 

the nucleic acid sequence of SEQ ID NO:2 with T at position 232 in exon 1 of UGTIA6 as 
defined by the position in SEQ ID NO:2; 

the nucleic acid sequence of SEQ ID NO:2 with A at position 528 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID N0:2; 
35 the nucleic add sequence of SEQ ID NO:2 with G at position 754 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID NO:2; 
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Ae nudeic add sequence of SEQ ID NO;2 with A at position 754 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID NO:2; 

the nudeic add sequence of SEQ ID NO:2 with C at position 765 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID NO:2; 
5 the nucleic acid sequence of SEQ ID NO:2 with A at position 765 in exon 1 of UGT1A6 as 
defined by the position in SEQ ID NOfi; 

the nudeic acid sequence of SEQ ID NO:3 with C at position 197 in exon 1 of UGT1A7 as 
defined by the position m SEQ ID NO:3; 

the nudeic add sequence of SEQ ID NO:3 with G at position 551 in exon 1 of UGT1A7 as 
10 defined by the position in SEQ ID NO:3; 

the nudeic add sequence of SEQ ID NO:3 with T at position 551 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 

the nucleic add sequence of SEQ ID NO:3 with A at position 555 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 
15 the nucleic add sequence of SEQ ID NO:3 with C at position 555 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 

the nudeic add sequence of SEQ ID NO:3 with A at position 556 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 

the nudeic acid sequence of SEQ ID NO;3 with G at position 556 in exon 1 of UGT1A7 as 
20 defined by the position in SEQ ID NO:3; 

the nudeic acid sequence of SEQ ID NO:3 with C at position 786 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; or 

the nudeic acid sequence of SEQ ID NO:3 with T at position 786 in exon 1 of UGT1A7 as 
defined by the position in SEQ ID NO:3; 
25 whenever used in combination with any one of the diagnostic nuddc adds as daimed in 
claim 7. 

9. A set of diagnostic nucleic adds comprising the following polymorphism 
containing sequences: 

the nudeic add sequence of SEQ ID NO:l with C at position 908 in exon 5 as defined by 
30 the position in SEQ ID NO:l; 

the nucleic add sequence of SEQ ID NO:l with G at position 908 in exon 5 as defined by . 
the position in SEQ ID NO:l; 

the nudeic acid sequence of SEQ ID NO:2 with G at position 754 in exon 1 as defined by 
the position in SEQ ID NO:2; 
35 the nudeic acid sequence of SEQ ID NO:2 with A at position 754 in exon 1 as defined by 
the position in SEQ ID NO:2; 

the nudeic acid sequence of SEQ ID NO:2 with C at position 765 in exon 1 as defined by 
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the position in SEQ ID NO:2; 

the nucleic acid sequence of SEQ ID NO:2 with A at position 765 in exon 1 as defined by 
the position m SEQ ID NO:2; or 

a complementary strand thereof or a fragment thereof of at least 20 bases comprising at 
least one of the polymorphisms. 

f 

10. A diagnostic nudeic add primer for detecting a polymorphism in the UGTl gene 
capable of hybridizing spedfically to a nucleic add having one of ihe polymorphisms as 
defined in claim 4. 

1 L A diagnostic nudeic acid primer as daimed in claim 10 which is an allele-spedfic 
nudeic acid primer having a sequence selected from the group consisting of: 
the nucleic acid sequence as defined by SEQ ID NO:24; 
tiie nudeic add sequence as defined by SEQ ID NO:25; 
the nudeic add sequence as defined by SEQ ID NO:27; 
the nucleic acid sequence as defined by SEQ ID N028; 
the nudeic add sequence as defined by SEQ ID NO:30; or 
the nucleic add sequence as defined by SEQ ID NO:31. 

12. An allde-spedfic oligonudeotide probe for detecting a polymorphism in the 
UGTl gene capable of hybridizing specifically to a nudeic acid having one of the 
polymorphisms as defined in daim 4. 

13. A diagnostic kit comprising one or more diagnostic primer(s) as defined in daim 
10 and/or one or more aUde-spedfic oligonudeotide probes(s) as defined in daim 12. 

14. A pharmaceutical pack comprising Tolcapone ^nd instructions for 
administration of the drug to humans diagnosticaUy tested for a single nudeotide 
polymorphism according to a method as daimed in any one of daims 1 to 5. 

15. A computer readable medixmi having stored thereon sequence information for 
the polymorphisms in UGTl at position 908 in exon 5 of the UGTl gene locus as defined 
by the position in SEQ ID NO:l and/or at position 528 in exon 1 of the UGT1A6 gene as 
defined by tiie position in SEQ ID NO:2 and/or at position 197 in exon 1 of the UGT1A7 
gene as defined by the position in SEQ ID NO:3. 

16. A method for performing sequence identification, said method comprising the 
steps of providing a diagnostic nudeic acid sequence as claimed in any one of daims 6 to 9 
and comparing said diagnostic nucleic acid sequence to at least one other nudeic add or 
polypeptde sequence to identify identity. 
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. 17. A method as hereinbefore described. 
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